<--- Back to Details
First PageDocument Content
Multi-armed bandit / Stochastic optimization / Reinforcement learning / SARSA / Normal distribution / Temporal difference learning / Q-learning / Statistics / Computational neuroscience / Machine learning
Date: 2011-12-02 21:34:52
Multi-armed bandit
Stochastic optimization
Reinforcement learning
SARSA
Normal distribution
Temporal difference learning
Q-learning
Statistics
Computational neuroscience
Machine learning

Add to Reading List

Source URL: www.tokic.com

Download Document from Source Website

File Size: 426,05 KB

Share Document on Facebook

Similar Documents

Cognition / Cognitive science / Neuroscience / Artificial intelligence / Multi-agent systems / Belief revision / Reinforcement learning / Computational neuroscience / Sarsa / Intelligent agent / Reinforcement / Anticipation

REINFORCEMENT LEARNING FOR LIVE MUSICAL AGENTS Nick Collins University of Sussex ABSTRACT Current research programmes in computer music may

DocID: 1q53K - View Document

Experiments with SARSA Eric B Baum Dennis Horte Chick Markley Azure Sky Research Inc

DocID: 1m1N0 - View Document

Science / Reinforcement learning / Multi-agent system / Replicator equation / Q-learning / Agent-based model / SARSA / Dynamical system / Evolutionary game theory / Machine learning / Evolutionary dynamics / Artificial intelligence

Frequency Adjusted Multi-agent Q-learning Michael Kaisers and Karl Tuyls Maastricht University Maastricht, The Netherlands {michael.kaisers, k.tuyls} @maastrichtuniversity.nl

DocID: 19XZj - View Document

Computational neuroscience / Cybernetics / Reinforcement learning / Q-learning / Temporal difference learning / SARSA / Markov decision process / Unsupervised learning / Recurrent neural network / Machine learning / Neural networks / Statistics

Playing Atari with Deep Reinforcement Learning Volodymyr Mnih Koray Kavukcuoglu

DocID: 19Byv - View Document

Multi-agent systems / Reinforcement learning / International Conference on Autonomous Agents and Multiagent Systems / Intelligent agent / Agent-based model / Machine learning / SARSA / Affect / Action selection / Artificial intelligence / Science / Ethology

Multi-Agent, Reward Shaping for RoboCup KeepAway (Extended Abstract) Sam Devlin Marek Grze“s

DocID: 15DqP - View Document