Bandit

Results: 280



#Item
51

JMLR: Workshop and Conference Proceedings vol–25th Annual Conference on Learning Theory Analysis of Thompson Sampling for the Multi-armed Bandit Problem Shipra Agrawal

Add to Reading List

Source URL: www.jmlr.org

Language: English - Date: 2012-06-17 06:50:47
    52

    JMLR: Workshop and Conference Proceedings vol 40:1–26, 2015 Regret Lower Bound and Optimal Algorithm in Dueling Bandit Problem Junpei Komiyama

    Add to Reading List

    Source URL: jmlr.org

    Language: English - Date: 2015-07-20 20:08:36
      53

      Scalable Discrete Sampling as a Multi-Armed Bandit Problem Yutian Chen Department of Engineering, University of Cambridge, Cambridge CB2 1PZ, UK YUTIAN . CHEN @ ENG . CAM . AC . UK

      Add to Reading List

      Source URL: www.cantab.net

      Language: English - Date: 2016-04-27 17:20:31
        54

        Non-Stochastic Bandit Slate Problems Satyen Kale Yahoo! Research Santa Clara, CA Lev Reyzin∗

        Add to Reading List

        Source URL: rob.schapire.net

        Language: English - Date: 2015-07-13 19:42:28
          55

          Journal of Machine Learning Research–1105 Submitted 2/05; Published 6/06 Action Elimination and Stopping Conditions for the Multi-Armed Bandit and Reinforcement Learning Problems∗

          Add to Reading List

          Source URL: www.jmlr.org

          Language: English - Date: 2006-06-22 16:25:37
            56

            APPENDIX – SUPPLEMENTARY MATERIAL Contextual Bandit Algorithms with Supervised Learning Guarantees Alina Beygelzimer IBM Research Hawthorne, NY

            Add to Reading List

            Source URL: jmlr.org

            Language: English
              57Natural language processing / Submodular set function / Machine learning / Recommender system / Algorithm / Information retrieval / Online algorithm / Multi-armed bandit / Automatic summarization

              Linear Submodular Bandits and their Application to Diversified Retrieval Carlos Guestrin Machine Learning Department Carnegie Mellon University

              Add to Reading List

              Source URL: select.cs.cmu.edu

              Language: English - Date: 2011-10-28 13:54:18
              58

              Optimization as Estimation with Gaussian Processes in Bandit Settings Anonymous Authors Affiliation

              Add to Reading List

              Source URL: bayesopt.github.io

              Language: English - Date: 2016-04-20 12:58:34
                59Statistics / Probability / Child development / Grasp / Reinforcement learning / Gittins index / Gaussian process / Sensitivity analysis / Monte Carlo integration / Probability distribution / Monte Carlo method / Sampling

                Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty Michael Laskey1 , Jeff Mahler1 , Zoe McCarthy1 , Florian T. Pokorny1 , Sachin Patil1 , Jur van den Berg4 , Danica Kragic3 , Pieter Abbeel1 , Ken Goldberg2

                Add to Reading List

                Source URL: goldberg.berkeley.edu

                Language: English - Date: 2015-08-31 02:12:22
                60

                Bayesian Incentive-Compatible Bandit Exploration∗ Yishay Mansour† Aleksandrs Slivkins‡ Vasilis Syrgkanis§

                Add to Reading List

                Source URL: arxiv.org

                Language: English - Date: 2015-10-27 06:12:36
                  UPDATE