Multi-armed bandit

Results: 113



#Item
1Online Learning for Personalized Room-Level Thermal Control: A Multi-Armed Bandit Framework Parisa Mansourifard Farrokh Jazizadeh

Online Learning for Personalized Room-Level Thermal Control: A Multi-Armed Bandit Framework Parisa Mansourifard Farrokh Jazizadeh

Add to Reading List

Source URL: anrg.usc.edu

Language: English - Date: 2013-12-06 19:17:18
    2Supplementary Material for ”Combinatorial multi-armed bandit: general framework, results and applications”, by Wei Chen, Yajun Wang, and Yang Yuan. A. Full proof of Theorem 1 We use the following two well known bound

    Supplementary Material for ”Combinatorial multi-armed bandit: general framework, results and applications”, by Wei Chen, Yajun Wang, and Yang Yuan. A. Full proof of Theorem 1 We use the following two well known bound

    Add to Reading List

    Source URL: proceedings.mlr.press

    Language: English - Date: 2018-07-16 03:38:06
      3Combinatorial Multi-Armed Bandit: General Framework, Results and Applications Wei Chen Microsoft Research Asia, Beijing, China

      Combinatorial Multi-Armed Bandit: General Framework, Results and Applications Wei Chen Microsoft Research Asia, Beijing, China

      Add to Reading List

      Source URL: proceedings.mlr.press

      Language: English - Date: 2018-07-16 03:38:06
        4THE NON-BAYESIAN RESTLESS MULTI-ARMED BANDIT: A CASE OF NEAR-LOGARITHMIC REGRET Wenhan Dai†∗ , Yi Gai‡ , Bhaskar Krishnamachari‡ , Qing Zhao§ †  School of Information Science and Technology, Tsinghua Universit

        THE NON-BAYESIAN RESTLESS MULTI-ARMED BANDIT: A CASE OF NEAR-LOGARITHMIC REGRET Wenhan Dai†∗ , Yi Gai‡ , Bhaskar Krishnamachari‡ , Qing Zhao§ † School of Information Science and Technology, Tsinghua Universit

        Add to Reading List

        Source URL: ceng.usc.edu

        Language: English - Date: 2011-10-16 14:12:22
          5Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards arXiv:1007.2238v2 [math.OC] 26 JulCem Tekin, Mingyan Liu

          Online Algorithms for the Multi-Armed Bandit Problem with Markovian Rewards arXiv:1007.2238v2 [math.OC] 26 JulCem Tekin, Mingyan Liu

          Add to Reading List

          Source URL: arxiv.org

          Language: English - Date: 2010-07-26 20:13:34
            6Dex-Net 1.0: A Cloud-Based Network of 3D Objects for Robust Grasp Planning Using a Multi-Armed Bandit Model with Correlated Rewards Jeffrey Mahler1 , Florian T. Pokorny1 , Brian Hou1 , Melrose Roderick1 , Michael Laskey1

            Dex-Net 1.0: A Cloud-Based Network of 3D Objects for Robust Grasp Planning Using a Multi-Armed Bandit Model with Correlated Rewards Jeffrey Mahler1 , Florian T. Pokorny1 , Brian Hou1 , Melrose Roderick1 , Michael Laskey1

            Add to Reading List

            Source URL: goldberg.berkeley.edu

            Language: English - Date: 2016-02-17 17:39:47
              7Stochastic Multi-Armed-Bandit Problem with Non-stationary Rewards Yonatan Gur Stanford University Stanford, CA

              Stochastic Multi-Armed-Bandit Problem with Non-stationary Rewards Yonatan Gur Stanford University Stanford, CA

              Add to Reading List

              Source URL: papers.nips.cc

              - Date: 2014-12-02 18:46:52
                8On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards Yi Gai∗ , Bhaskar Krishnamachari∗ and Mingyan Liu‡ ∗ ‡

                On the Combinatorial Multi-Armed Bandit Problem with Markovian Rewards Yi Gai∗ , Bhaskar Krishnamachari∗ and Mingyan Liu‡ ∗ ‡

                Add to Reading List

                Source URL: www-scf.usc.edu

                - Date: 2011-07-08 03:24:20
                  9THE NON-BAYESIAN RESTLESS MULTI-ARMED BANDIT: A CASE OF NEAR-LOGARITHMIC REGRET Wenhan Dai†∗ , Yi Gai‡ , Bhaskar Krishnamachari‡ , Qing Zhao§ †  School of Information Science and Technology, Tsinghua Universit

                  THE NON-BAYESIAN RESTLESS MULTI-ARMED BANDIT: A CASE OF NEAR-LOGARITHMIC REGRET Wenhan Dai†∗ , Yi Gai‡ , Bhaskar Krishnamachari‡ , Qing Zhao§ † School of Information Science and Technology, Tsinghua Universit

                  Add to Reading List

                  Source URL: www-scf.usc.edu

                  - Date: 2011-02-13 17:38:25
                    10Allocating Training Instances to Learning Agents that Improve Coordination for Team Formation Somchaya Liemhetcharat1 and Manuela Veloso2 1

                    Allocating Training Instances to Learning Agents that Improve Coordination for Team Formation Somchaya Liemhetcharat1 and Manuela Veloso2 1

                    Add to Reading List

                    Source URL: somchaya.org

                    Language: English - Date: 2014-09-25 23:11:01