Date: 2016-07-08 04:40:26Dynamic programming Equations Stochastic control Systems theory Control theory Systems science Markov processes Mathematics Markov decision process Mathematical optimization Bellman equation Reinforcement learning | | Approximate Policy Iteration for Markov Decision Processes via Quantitative Adaptive Aggregations ? ˇ ska1,2 , and Marta Kwiatkowska1 Alessandro Abate1 , Milan Ceˇ 2Add to Reading ListSource URL: qav.comlab.ox.ac.ukDownload Document from Source Website File Size: 190,84 KBShare Document on Facebook
|