Back to Results
First PageMeta Content
Dynamic programming / Markov processes / Stochastic control / Belief revision / Reinforcement learning / Markov decision process / Probability distribution


Targeting Specific Distributions of Trajectories in MDPs∗ David L. Roberts1 , Mark J. Nelson1 , Charles L. Isbell, Jr.1 , Michael Mateas1 , Michael L. Littman2 1 2
Add to Reading List

Open Document

File Size: 274,88 KB

Share Result on Facebook