First Page | Document Content | |
---|---|---|
![]() Date: 2008-03-25 15:09:21Stochastic control Kullback–Leibler divergence Reinforcement learning Statistics Dynamic programming Markov decision process | Source URL: books.nips.ccDownload Document from Source WebsiteFile Size: 160,45 KBShare Document on Facebook |