Q-learning

Results: 742



#Item
21Mathematics / Mathematical optimization / Dynamic programming / Mathematical analysis / Equations / Operations research / Systems theory / Stochastic control / Bellman equation / Markov decision process / Q-learning / Reinforcement learning

Increasing the Action Gap: New Operators for Reinforcement Learning Marc G. Bellemare and Georg Ostrovski and Arthur Guez Philip S. Thomas∗ and R´emi Munos Google DeepMind {bellemare,ostrovski,aguez,munos}@google.com;

Add to Reading List

Source URL: psthomas.com

Language: English - Date: 2015-12-12 00:05:18
22Cognitive science / Cognition / Artificial intelligence / Machine learning / Belief revision / Reinforcement learning / Temporal difference learning / Q-learning / Feature selection / Supervised learning / Proto-value functions / Action selection

Evolutionary Feature Evaluation for Online Reinforcement Learning

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
23Artificial intelligence / Computer programming / Cognitive science / Software engineering / Predation / Automated planning and scheduling / Behavior / Q-learning / Behavior tree / Reinforcement learning / B-tree / Ethology

Paper Title (use style: paper title)

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
24Algebra / Mathematics / Mathematical analysis / Image processing / Functional analysis / Linear filters / Feature detection / Filter theory / Image texture / Convolution / Distribution / Filter

LEARNING SIMPLE TEXTURE DISCRIMINATION FILTERS Rui F. C. Guerreiro Pedro M. Q. Aguiar Institute for Systems and Robotics, Instituto Superior T´ecnico

Add to Reading List

Source URL: users.isr.ist.utl.pt

Language: English - Date: 2010-06-29 10:21:08
25Artificial intelligence / Machine learning / Computational neuroscience / Learning / Applied mathematics / Artificial neural network / Mathematical psychology / Q-learning / Reinforcement learning / Supervised learning / Feature learning / Temporal difference learning

Continuous Deep Q-Learning with Model-based Acceleration arXiv:1603.00748v1 [cs.LG] 2 Mar 2016 Shixiang Gu1 2 3 SG 717@ CAM . AC . UK

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2016-03-02 20:31:58
26Cognitive science / Cognition / Dynamic programming / Markov processes / Stochastic control / Artificial intelligence / Belief revision / Reinforcement learning / Q-learning / Markov decision process / Action selection / Machine learning

Using Plan-Based Reward Shaping To Learn Strategies in StarCraft: Broodwar Kyriakos Efthymiadis Daniel Kudenko

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
27Machine learning / Applied mathematics / Artificial intelligence / Computational neuroscience / Artificial neural networks / Computational statistics / Mathematical psychology / Deep learning / Reinforcement learning / Recurrent neural network / Q-learning / Speech recognition

Benchmarking Deep Reinforcement Learning for Continuous Control Yan Duan† ROCKYDUAN @ EECS . BERKELEY. EDU Xi Chen† C . XI @ EECS . BERKELEY. EDU

Add to Reading List

Source URL: jmlr.org

Language: English - Date: 2016-07-20 01:41:07
28Scoutcraft / Outdoor recreation / Learning / Leisure / Mountaineering / Backpacking / Ulaanbaatar / Camping / Mongolia / Q / Hiking / Backpack

+Mountaineering in Mongolia June 23 – July 13, 2016 LOCATION: Mongolia GUIDE: Helen Sovdat CAMP MANAGER: Brenda Critchley

Add to Reading List

Source URL: www.alpineclubofcanada.ca

Language: English - Date: 2015-05-28 13:34:05
29Learning to read / Language / Lexicography / Vocabulary / Quality assurance / Thesauri / Controlled vocabularies

BOOKS ABOUT QUALITY VOCABULARY Cityhalllosangeles.com QUALITY VOCABULARY

Add to Reading List

Source URL: q.cityhalllosangeles.com

Language: English - Date: 2015-03-05 01:53:23
30Belief revision / Reinforcement learning / Q-learning / Apprenticeship learning / Dynamic programming / Machine learning / Algorithm / Robotics / Support vector machine

SWIRL: A Sequential Windowed Inverse Reinforcement Learning Algorithm for Robot Tasks With Delayed Rewards Sanjay Krishnan, Animesh Garg, Richard Liaw, Brijen Thananjeyan, Lauren Miller, Florian T. Pokorny∗ , Ken Goldb

Add to Reading List

Source URL: goldberg.berkeley.edu

Language: English - Date: 2016-07-21 11:29:30
UPDATE