Thompson Sampling for Complex Online Problems

First Page		Document Content
Date: 2014-02-16 19:30:21 Econometrics Statistical inference Machine learning Confidence interval Multi-armed bandit Thompson sampling Reinforcement learning Bayes estimator Dimensional analysis Statistics Measurement Estimation theory		Thompson Sampling for Complex Online Problems Add to Reading List Source URL: jmlr.org Download Document from Source Website File Size: 3,17 MB Share Document on Facebook

	Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov DocID: 1xVVh - View Document
	Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement Learning DocID: 1xVKs - View Document
	Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning DocID: 1xV3l - View Document
	Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017 DocID: 1xUBi - View Document
	Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018 DocID: 1xUAT - View Document