Scholars@Duke publication: Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs

Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs

Publication , Journal Article

Hsu, WK; Xu, J; Lin, X; Bell, MR

Published in: Operations Research

March 1, 2022

We study task assignment in online service platforms, where unlabeled clients arrive according to a stochastic process and each client brings a random number of tasks. As tasks are assigned to servers, they produce client/server-dependent random payoffs. The goal of the system operator is to maximize the expected payoff per unit time subject to the servers’ capacity constraints. However, both the statistics of the dynamic client population and the client-specific payoff vectors are unknown to the operator. Thus, the operator must design task-assignment policies that integrate adaptive control (of the queueing system) with online learning (of the clients’ payoff vectors). A key challenge in such integration is how to account for the nontrivial closed-loop interactions between the queueing process and the learning process, which may significantly degrade system performance. We propose a new utility-guided online learning and task assignment algorithm that seamlessly integrates learning with control to address such difficulty. Our analysis shows that, compared with an oracle that knows all client dynamics and payoff vectors beforehand, the gap of the expected payoff per unit time of our proposed algorithm can be analytically bounded by three terms, which separately capture the impact of the client-dynamic uncertainty, client-server payoff uncertainty, and the loss incurred by backlogged clients in the system. Further, our bound holds for any finite time horizon. Through simulations, we show that our proposed algorithm significantly outperforms a myopic-matching policy and a standard queue-length-based policy that does not explicitly address the closed-loop interactions between queueing and learning.

Duke Scholars

Author Jiaming Xu Fuqua School of Business

Published In

Operations Research

DOI

10.1287/opre.2021.2100

EISSN

1526-5463

ISSN

0030-364X

Publication Date

March 1, 2022

Volume

Issue

Start / End Page

1166 / 1181

Related Subject Headings

Operations Research
3507 Strategy, management and organisational behaviour
1503 Business and Management
0802 Computation Theory and Mathematics
0102 Applied Mathematics

Citation

APA

Chicago

ICMJE

MLA

NLM

Hsu, W. K., Xu, J., Lin, X., & Bell, M. R. (2022). Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs. Operations Research, 70(2), 1166–1181. https://doi.org/10.1287/opre.2021.2100

Hsu, W. K., J. Xu, X. Lin, and M. R. Bell. “Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs.” Operations Research 70, no. 2 (March 1, 2022): 1166–81. https://doi.org/10.1287/opre.2021.2100.

Hsu WK, Xu J, Lin X, Bell MR. Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs. Operations Research. 2022 Mar 1;70(2):1166–81.

Hsu, W. K., et al. “Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs.” Operations Research, vol. 70, no. 2, Mar. 2022, pp. 1166–81. Scopus, doi:10.1287/opre.2021.2100.

Hsu WK, Xu J, Lin X, Bell MR. Integrated Online Learning and Adaptive Control in Queueing Systems with Uncertain Payoffs. Operations Research. 2022 Mar 1;70(2):1166–1181.

Published In

Operations Research

DOI

10.1287/opre.2021.2100

EISSN

1526-5463

ISSN

0030-364X

Publication Date

March 1, 2022

Volume

Issue

Start / End Page

1166 / 1181

Related Subject Headings

Operations Research
3507 Strategy, management and organisational behaviour
1503 Business and Management
0802 Computation Theory and Mathematics
0102 Applied Mathematics