First Page | Document Content | |
---|---|---|
![]() Date: 2011-12-02 21:34:52Reinforcement learning Q-learning Multi-armed bandit Statistics SARSA Normal distribution | Source URL: www.tokic.comDownload Document from Source WebsiteFile Size: 653,77 KBShare Document on Facebook |