Scholars@Duke publication: Approximation algorithms for budgeted learning problems

Approximation algorithms for budgeted learning problems

Publication , Conference

Guha, S; Munagala, K

Published in: Proceedings of the Annual ACM Symposium on Theory of Computing

October 30, 2007

We present the first approximation algorithms for a large class of budgeted learning problems. One classicexample of the above is the budgeted multi-armed bandit problem. In this problem each arm of the bandithas an unknown reward distribution on which a prior isspecified as input. The knowledge about the underlying distribution can be refined in the exploration phase by playing the arm and observing the rewards. However, there is a budget on the total number of plays allowed during exploration. After this exploration phase,the arm with the highest (posterior) expected reward is hosen for exploitation. The goal is to design the adaptive exploration phase subject to a budget constraint on the number of plays, in order to maximize the expected reward of the arm chosen for exploitation. While this problem is reasonably well understood in the infinite horizon discounted reward setting, the budgeted version of the problem is NP-Hard. For this problem and several generalizations, we provide approximate policies that achieve a reward within constant factor of the reward optimal policy. Our algorithms use a novel linear program rounding technique based on stochastic packing. Copyright 2007 ACM.

Duke Scholars

Author Kamesh Munagala Computer Science

Published In

Proceedings of the Annual ACM Symposium on Theory of Computing

DOI

10.1145/1250790.1250807

ISSN

0737-8017

ISBN

9781595936318

Publication Date

October 30, 2007

Start / End Page

104 / 113

Citation

APA

Chicago

ICMJE

MLA

NLM

Guha, S., & Munagala, K. (2007). Approximation algorithms for budgeted learning problems. In Proceedings of the Annual ACM Symposium on Theory of Computing (pp. 104–113). https://doi.org/10.1145/1250790.1250807

Guha, S., and K. Munagala. “Approximation algorithms for budgeted learning problems.” In Proceedings of the Annual ACM Symposium on Theory of Computing, 104–13, 2007. https://doi.org/10.1145/1250790.1250807.

Guha S, Munagala K. Approximation algorithms for budgeted learning problems. In: Proceedings of the Annual ACM Symposium on Theory of Computing. 2007. p. 104–13.

Guha, S., and K. Munagala. “Approximation algorithms for budgeted learning problems.” Proceedings of the Annual ACM Symposium on Theory of Computing, 2007, pp. 104–13. Scopus, doi:10.1145/1250790.1250807.

Guha S, Munagala K. Approximation algorithms for budgeted learning problems. Proceedings of the Annual ACM Symposium on Theory of Computing. 2007. p. 104–113.

Published In

Proceedings of the Annual ACM Symposium on Theory of Computing

DOI

10.1145/1250790.1250807

ISSN

0737-8017

ISBN

9781595936318

Publication Date

October 30, 2007

Start / End Page

104 / 113