Bandit - PDFSEARCH.IO - Document Search Engine

Bandit
Results: 280

#	Item
81	Deterministic MDPs with Adversarial Rewards and Bandit Feedback Raman Arora TTIC 6045 S. Kenwood Ave. Chicago, IL 60637, USA Add to Reading List Source URL: dept.stat.lsa.umich.edu Language: English - Date: 2012-09-12 18:50:24 Markov models Markov processes Stochastic optimization Mathematical optimization Operations research Reinforcement learning Markov decision process Algorithm Multi-armed bandit Dynamic programming Shortest path problem PP
82	4164 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 58, NO. 7, JULY 2012 Interior-Point Methods for Full-Information and Bandit Online Learning Add to Reading List Source URL: www-stat.wharton.upenn.edu Language: English - Date: 2013-03-18 20:18:38 Mathematical optimization Convex analysis Operations research Convex optimization Machine learning Self-concordant function Interior point method Ellipsoid method Linear programming Algorithm Convex set Stability
83	Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric Add to Reading List Source URL: victorgabillon.nfshost.com Language: English - Date: 2010-07-01 09:47:14 Mathematics Mathematical analysis Artificial intelligence Backgammon Rollout Markov decision process Multi-armed bandit Reinforcement learning Inverted pendulum Pendulum Prime-counting function Valuation
84	An Empirical Evaluation of Thompson Sampling Lihong Li Yahoo! Research Santa Clara, CA Add to Reading List Source URL: papers.nips.cc Language: English - Date: 2014-02-24 03:34:34 Statistics Probability distributions Statistical inference Estimation theory Machine learning Stochastic optimization Bayesian inference Sampling Normal distribution Beta distribution Multi-armed bandit Confidence interval
85	Two-Sided Bandits and the Dating Market Sanmay Das Center for Biological and Computational Learning and Computer Science and Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139 Add to Reading List Source URL: faculty.chicagobooth.edu Language: English - Date: 2006-08-08 10:56:19 Matching Combinatorics Game theory Fellows of the Econometric Society Cooperative games Stable marriage problem CC Reinforcement learning Multi-armed bandit Alvin E. Roth Greedy algorithm Algorithm
86	Multi-armed Bandit Problems with History Pannagadatta Shivaswamy and Thorsten Joachims Department of Computer Science, Cornell University, Ithaca NY {pannaga,tj}@cs.cornell.edu 1 Add to Reading List Source URL: snowbird.djvuzone.org Language: English - Date: 2011-02-10 15:51:00
87	The Blinded Bandit: Learning with Adaptive Feedback Ofer Dekel Microsoft Research Add to Reading List Source URL: tx.technion.ac.il Language: English - Date: 2016-03-06 01:23:31
88	Multi-armed bandit experiments in the online service economy Steven L. Scott December 20, 2014 Abstract The modern service economy is substantively different from the agricultural and manufacturing economies that precede Add to Reading List Source URL: faculty.chicagobooth.edu Language: English - Date: 2015-01-20 12:35:42
89	Machine Learning manuscript No. (will be inserted by the editor) Bandit-Based Monte-Carlo Structure Learning of Probabilistic Logic Programs Nicola Di Mauro · Elena Bellodi · Fabrizio Add to Reading List Source URL: ds.ing.unife.it Language: English - Date: 2015-05-28 03:57:19
90	Bandit Smooth Convex Optimization: Improving the Bias-Variance Tradeoff Ofer Dekel Microsoft Research Redmond, WA Add to Reading List Source URL: tx.technion.ac.il Language: English - Date: 2016-03-06 01:23:30

UPDATE