<--- Back to Details
First PageDocument Content
Econometrics / Statistical inference / Machine learning / Confidence interval / Multi-armed bandit / Thompson sampling / Reinforcement learning / Bayes estimator / Dimensional analysis / Statistics / Measurement / Estimation theory
Date: 2014-02-16 19:30:21
Econometrics
Statistical inference
Machine learning
Confidence interval
Multi-armed bandit
Thompson sampling
Reinforcement learning
Bayes estimator
Dimensional analysis
Statistics
Measurement
Estimation theory

Thompson Sampling for Complex Online Problems

Add to Reading List

Source URL: jmlr.org

Download Document from Source Website

File Size: 3,17 MB

Share Document on Facebook

Similar Documents

Computational neuroscience / Artificial intelligence / Machine learning algorithms / Neuroscience / Applied mathematics / Artificial neural networks / Reinforcement learning / Q-learning / Convolutional neural network / Distributed artificial intelligence / Deep learning / Intelligent agent

Cooperative Multi-Agent Control Using Deep Reinforcement Learning Jayesh K. Gupta Maxim Egorov

DocID: 1xVVh - View Document

Machine learning algorithms / Psychology / Neuroscience / Learning / Behaviorism / Belief revision / Reinforcement learning / Psychological manipulation / Reinforcement / Markov decision process / Intelligent agent

Distributed Computing Prof. R. Wattenhofer SA/MA: Byzantine Reinforcement Learning

DocID: 1xVKs - View Document

Artificial intelligence / Cybernetics / Neuroscience / Cognitive science / Computer vision / CAPTCHA / Computer security / Computational neuroscience / Machine learning / Turing test / Deep learning / Andrew Ng

Distributed Computing Prof. R. Wattenhofer Generating CAPTCHAs with Deep (Reinforcement) Learning

DocID: 1xV3l - View Document

Machine learning algorithms / Money / Finance / Artificial intelligence / Language acquisition / Richard S. Sutton / Reinforcement learning / Bootstrapping / Andrew Barto / Barto

Multi-step Bootstrapping Jennifer She Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto February 7, 2017

DocID: 1xUBi - View Document

Ambient intelligence / Internet of things / Long short-term memory / Scheduling / Future / Technology / Applied mathematics

Cellular Network Traffic Scheduling using Deep Reinforcement Learning Sandeep Chinchali, et. al. Marco Pavone, Sachin Katti Stanford University AAAI 2018

DocID: 1xUAT - View Document