Date: 2015-07-20 20:08:36Statistics Statistical theory Estimation theory Bayesian statistics Statistical inference Loss function Markov decision process Reinforcement learning Exponential family Confidence interval Likelihood function Conjugate prior | | JMLR: Workshop and Conference Proceedings vol 40:1–38, 2015 Thompson Sampling for Learning Parameterized Markov Decision Processes Aditya GopalanAdd to Reading ListSource URL: jmlr.orgDownload Document from Source Website File Size: 483,51 KBShare Document on Facebook
|