Back to Results
First PageMeta Content
Stochastic control / Kullback–Leibler divergence / Reinforcement learning / Statistics / Dynamic programming / Markov decision process


Document Date: 2008-03-25 15:09:21


Open Document

File Size: 160,45 KB

Share Result on Facebook

City

Brafman / New York / Cambridge / /

Company

ACM Press / MIT Press / John Wiley and Sons / Computer Sciences / /

Country

United States / /

/

Facility

Statistics University of California / University of California at Berkeley / /

Holiday

Assumption / /

IndustryTerm

logarithmic regret algorithm / uncertain systems / polynomial time algorithm / /

Movie

From now on / /

Organization

University of California / Berkeley / University of California / Department of Electrical Engineering and Computer Sciences / Statistics University / MIT / /

Person

Peter L. Bartlett / /

Position

RT / first author / /

ProvinceOrState

Rhode Island / California / /

PublishedMedium

Journal of Machine Learning Research / Machine Learning / /

Technology

Machine Learning / simulation / artificial intelligence / optimistic LP algorithm / polynomial time algorithm / Burnetas-Katehakis algorithm / Optimistic Linear Programming algorithm / logarithmic regret algorithm / /

SocialTag