First Page | Document Content | |
---|---|---|
![]() Date: 2011-03-31 14:42:20Machine learning Multi-armed bandit Stochastic optimization Decision theory Gittins index Reinforcement learning Bandit Kullback–Leibler divergence Probability distribution Statistics Design of experiments Statistical theory | Add to Reading List |