Q-learning

Results: 742



#Item
21Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;

Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2015-12-12 00:05:18
22Evolutionary Feature Evaluation for Online Reinforcement Learning

Evolutionary Feature Evaluation for Online Reinforcement Learning

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
23Paper Title (use style: paper title)

Paper Title (use style: paper title)

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
24LEARNING SIMPLE TEXTURE DISCRIMINATION FILTERS Rui F. C. Guerreiro Pedro M. Q. Aguiar  Institute for Systems and Robotics, Instituto Superior T´ecnico

LEARNING SIMPLE TEXTURE DISCRIMINATION FILTERS Rui F. C. Guerreiro Pedro M. Q. Aguiar Institute for Systems and Robotics, Instituto Superior T´ecnico

Add to Reading List

Source URL: users.isr.ist.utl.pt

Language: English - Date: 2010-06-29 10:21:08
25Continuous Deep Q-Learning with Model-based Acceleration  arXiv:1603.00748v1 [cs.LG] 2 Mar 2016 Shixiang Gu1 2 3 SG 717@ CAM . AC . UK

Continuous Deep Q-Learning with Model-based Acceleration arXiv:1603.00748v1 [cs.LG] 2 Mar 2016 Shixiang Gu1 2 3 SG 717@ CAM . AC . UK

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2016-03-02 20:31:58
26Using Plan-Based Reward Shaping To Learn Strategies in StarCraft: Broodwar Kyriakos Efthymiadis Daniel Kudenko

Using Plan-Based Reward Shaping To Learn Strategies in StarCraft: Broodwar Kyriakos Efthymiadis Daniel Kudenko

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
27Benchmarking Deep Reinforcement Learning for Continuous Control  Yan Duan† ROCKYDUAN @ EECS . BERKELEY. EDU Xi Chen† C . XI @ EECS . BERKELEY. EDU

Benchmarking Deep Reinforcement Learning for Continuous Control Yan Duan† ROCKYDUAN @ EECS . BERKELEY. EDU Xi Chen† C . XI @ EECS . BERKELEY. EDU

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2016-07-20 01:41:07
28+Mountaineering in Mongolia June 23 – July 13, 2016 LOCATION: Mongolia GUIDE: Helen Sovdat CAMP MANAGER: Brenda Critchley

+Mountaineering in Mongolia June 23 – July 13, 2016 LOCATION: Mongolia GUIDE: Helen Sovdat CAMP MANAGER: Brenda Critchley

Add to Reading List

Source URL: www.alpineclubofcanada.ca

Language: English - Date: 2015-05-28 13:34:05
29BOOKS ABOUT QUALITY VOCABULARY  Cityhalllosangeles.com QUALITY VOCABULARY

BOOKS ABOUT QUALITY VOCABULARY Cityhalllosangeles.com QUALITY VOCABULARY

Add to Reading List

Source URL: q.cityhalllosangeles.com

Language: English - Date: 2015-03-05 01:53:23
30SWIRL: A Sequential Windowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards Sanjay Krishnan, Animesh Garg, Richard Liaw, Brijen Thananjeyan, Lauren Miller, Florian T. Pokorny∗ , Ken Goldb

SWIRL: A Sequential Windowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards Sanjay Krishnan, Animesh Garg, Richard Liaw, Brijen Thananjeyan, Lauren Miller, Florian T. Pokorny∗ , Ken Goldb

Add to Reading List

Source URL: goldberg.berkeley.edu

Language: English - Date: 2016-07-21 11:29:30