Skip to main content

Optimal Batched Best Arm Identification

Publication ,  Conference
Jin, T; Yang, Y; Tang, J; Xiao, X; Xu, P
Published in: Advances in Neural Information Processing Systems
January 1, 2024

We study the batched best arm identification (BBAI) problem, where the learner's goal is to identify the best arm while switching the policy as less as possible. In particular, we aim to find the best arm with probability 1−δ for some small constant δ > 0 while minimizing both the sample complexity (total number of arm pulls) and the batch complexity (total number of batches). We propose the three-batch best arm identification (Tri-BBAI) algorithm, which is the first batched algorithm that achieves the optimal sample complexity in the asymptotic setting (i.e., δ → 0) and runs in 3 batches in expectation. Based on Tri-BBAI, we further propose the almost optimal batched best arm identification (Opt-BBAI) algorithm, which is the first algorithm that achieves the near-optimal sample and batch complexity in the non-asymptotic setting (i.e., δ is finite), while enjoying the same batch and sample complexity as Tri-BBAI when δ tends to zero. Moreover, in the non-asymptotic setting, the complexity of previous batch algorithms is usually conditioned on the event that the best arm is returned (with a probability of at least 1 − δ), which is potentially unbounded in cases where a sub-optimal arm is returned. In contrast, the complexity of Opt-BBAI does not rely on such an event. This is achieved through a novel procedure that we design for checking whether the best arm is eliminated, which is of independent interest.

Duke Scholars

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Jin, T., Yang, Y., Tang, J., Xiao, X., & Xu, P. (2024). Optimal Batched Best Arm Identification. In Advances in Neural Information Processing Systems (Vol. 37).
Jin, T., Y. Yang, J. Tang, X. Xiao, and P. Xu. “Optimal Batched Best Arm Identification.” In Advances in Neural Information Processing Systems, Vol. 37, 2024.
Jin T, Yang Y, Tang J, Xiao X, Xu P. Optimal Batched Best Arm Identification. In: Advances in Neural Information Processing Systems. 2024.
Jin, T., et al. “Optimal Batched Best Arm Identification.” Advances in Neural Information Processing Systems, vol. 37, 2024.
Jin T, Yang Y, Tang J, Xiao X, Xu P. Optimal Batched Best Arm Identification. Advances in Neural Information Processing Systems. 2024.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

37

Related Subject Headings

  • 4611 Machine learning
  • 1702 Cognitive Sciences
  • 1701 Psychology