First Page | Document Content | |
---|---|---|
![]() Date: 2005-10-06 09:29:58Multi-armed bandit Reinforcement learning Normal distribution Central limit theorem Algorithm Markov decision process Statistics Stochastic optimization Machine learning | Source URL: bandit.sourceforge.netDownload Document from Source WebsiteFile Size: 496,24 KBShare Document on Facebook |