SARSA

Results: 43



#Item
21Markov chain / Q-learning / Artificial intelligence / Learning / Statistics / SARSA / Temporal difference learning

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda Carlton Downey Victoria University of Wellington, Wellington, New Zealand

Add to Reading List

Source URL: users.cecs.anu.edu.au

Language: English - Date: 2010-05-05 13:10:11
22SARSA / Q-learning / Reinforcement learning / Temporal difference learning / Machine learning / Algorithm / Apprenticeship learning / Spatial memory / Artificial intelligence / Learning / Mathematics

Learning to Follow Navigational Directions Adam Vogel and Dan Jurafsky Department of Computer Science Stanford University {acvogel,jurafsky}@stanford.edu

Add to Reading List

Source URL: nlp.stanford.edu

Language: English - Date: 2010-05-17 17:59:35
23Stochastic control / SARSA / Markov models / Theoretical computer science / Reinforcement learning / Q-learning / Council on Environmental Quality / Temporal difference learning / Partially observable Markov decision process / Statistics / Markov processes / Dynamic programming

Consistent exploration improves convergence of reinforcement learning on POMDPs Paul A. Crook Gillian Hayes

Add to Reading List

Source URL: homepages.inf.ed.ac.uk

Language: English - Date: 2007-07-04 12:19:49
24Cognitive architecture / Reinforcement learning / Q-learning / Temporal difference learning / Motivation / Action selection / Modularity / SARSA / ACT-R / Artificial intelligence / Behavior / Mind

LOGO_frontiersinpsychology

Add to Reading List

Source URL: www.cs.utexas.edu

Language: English - Date: 2013-03-19 13:42:21
25Computational neuroscience / Cybernetics / Reinforcement learning / Q-learning / Temporal difference learning / SARSA / Markov decision process / Unsupervised learning / Recurrent neural network / Machine learning / Neural networks / Statistics

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2013-12-19 20:23:45
26Statistics / Cybernetics / Computational statistics / Artificial intelligence / Learning / Q-learning / Artificial neural network / Reinforcement learning / SARSA / Computational neuroscience / Neural networks / Machine learning

A neural reinforcement learning model for tasks with unknown time delays Daniel Rasmussen ([removed]) Chris Eliasmith ([removed]) Centre for Theoretical Neuroscience, University of Waterloo Wate

Add to Reading List

Source URL: mindmodeling.org

Language: English - Date: 2013-07-15 14:54:54
27Thought / Machine learning / Apprenticeship learning / Computational neuroscience / Reinforcement learning / Abstraction / Q-learning / Ada / SARSA / Computing / Mathematics / Cognition

Automatic Task Decomposition and State Abstraction from Demonstration Luis C. Cobo Charles L. Isbell Jr.

Add to Reading List

Source URL: www.cc.gatech.edu

Language: English - Date: 2012-03-31 18:30:01
28Q-learning / Markov decision process / Theoretical computer science / SARSA / Algorithm / Function / Statistics / Mathematics / Reinforcement learning

Object Focused Q-learning for Autonomous Agents Luis C. Cobo Charles L. Isbell Jr. Andrea L. Thomaz

Add to Reading List

Source URL: www.cc.gatech.edu

Language: English - Date: 2013-04-16 11:54:07
29Stochastic control / Reinforcement learning / Computational neuroscience / Partially observable Markov decision process / Machine learning / Markov decision process / Q-learning / Self-reconfiguring modular robot / SARSA / Statistics / Dynamic programming / Markov processes

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning Paulina Varshavskaya, Leslie Pack Kaelbling and Daniela Rus Computer Science and AI Laboratory Massachusetts Institute of Technolog

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2007-09-07 17:21:08
30Control theory / Linear filters / Stochastic differential equations / Kalman filter / Markov decision process / Normal distribution / Gaussian process / Q-learning / SARSA / Statistics / Markov models / Stochastic processes

Reinforcement learning with Gaussian processes Yaakov Engel Dept. of Computing Science, University of Alberta, Edmonton, Canada Shie Mannor Dept. of Electrical and Computer Engineering, McGill University, Montreal, Cana

Add to Reading List

Source URL: www.machinelearning.org

Language: English - Date: 2008-12-01 11:15:01
UPDATE