Bandit

Results: 280



#Item
81Markov models / Markov processes / Stochastic optimization / Mathematical optimization / Operations research / Reinforcement learning / Markov decision process / Algorithm / Multi-armed bandit / Dynamic programming / Shortest path problem / PP

Deterministic MDPs with Adversarial Rewards and Bandit Feedback Raman Arora TTIC 6045 S. Kenwood Ave. Chicago, IL 60637, USA

Add to Reading List

Source URL: dept.stat.lsa.umich.edu

Language: English - Date: 2012-09-12 18:50:24
82Mathematical optimization / Convex analysis / Operations research / Convex optimization / Machine learning / Self-concordant function / Interior point method / Ellipsoid method / Linear programming / Algorithm / Convex set / Stability

4164 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 58, NO. 7, JULY 2012 Interior-Point Methods for Full-Information and Bandit Online Learning

Add to Reading List

Source URL: www-stat.wharton.upenn.edu

Language: English - Date: 2013-03-18 20:18:38
83Mathematics / Mathematical analysis / Artificial intelligence / Backgammon / Rollout / Markov decision process / Multi-armed bandit / Reinforcement learning / Inverted pendulum / Pendulum / Prime-counting function / Valuation

Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2010-07-01 09:47:14
84Statistics / Probability distributions / Statistical inference / Estimation theory / Machine learning / Stochastic optimization / Bayesian inference / Sampling / Normal distribution / Beta distribution / Multi-armed bandit / Confidence interval

An Empirical Evaluation of Thompson Sampling Lihong Li Yahoo! Research Santa Clara, CA

Add to Reading List

Source URL: papers.nips.cc

Language: English - Date: 2014-02-24 03:34:34
85Matching / Combinatorics / Game theory / Fellows of the Econometric Society / Cooperative games / Stable marriage problem / CC / Reinforcement learning / Multi-armed bandit / Alvin E. Roth / Greedy algorithm / Algorithm

Two-Sided Bandits and the Dating Market Sanmay Das Center for Biological and Computational Learning and Computer Science and Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139

Add to Reading List

Source URL: faculty.chicagobooth.edu

Language: English - Date: 2006-08-08 10:56:19
86

Multi-armed Bandit Problems with History Pannagadatta Shivaswamy and Thorsten Joachims Department of Computer Science, Cornell University, Ithaca NY {pannaga,tj}@cs.cornell.edu 1

Add to Reading List

Source URL: snowbird.djvuzone.org

Language: English - Date: 2011-02-10 15:51:00
    87

    The Blinded Bandit: Learning with Adaptive Feedback Ofer Dekel Microsoft Research

    Add to Reading List

    Source URL: tx.technion.ac.il

    Language: English - Date: 2016-03-06 01:23:31
      88

      Multi-armed bandit experiments in the online service economy Steven L. Scott December 20, 2014 Abstract The modern service economy is substantively different from the agricultural and manufacturing economies that precede

      Add to Reading List

      Source URL: faculty.chicagobooth.edu

      Language: English - Date: 2015-01-20 12:35:42
        89

        Machine Learning manuscript No. (will be inserted by the editor) Bandit-Based Monte-Carlo Structure Learning of Probabilistic Logic Programs Nicola Di Mauro · Elena Bellodi · Fabrizio

        Add to Reading List

        Source URL: ds.ing.unife.it

        Language: English - Date: 2015-05-28 03:57:19
          90

          Bandit Smooth Convex Optimization: Improving the Bias-Variance Tradeoff Ofer Dekel Microsoft Research Redmond, WA

          Add to Reading List

          Source URL: tx.technion.ac.il

          Language: English - Date: 2016-03-06 01:23:30
            UPDATE