Scholars@Duke publication: Approximations to stochastic dynamic programs via information relaxation duality

Approximations to stochastic dynamic programs via information relaxation duality

Publication , Journal Article

Balseiro, SR; Brown, DB

Published in: Operations Research

January 1, 2019

In the analysis of complex stochastic dynamic programs, we often seek strong theoretical guarantees on the suboptimality of heuristic policies. One technique for obtaining performance bounds is perfect information analysis: this approach provides bounds on the performance of an optimal policy by considering a decision maker who has access to the outcomes of all future uncertainties before making decisions, that is, fully relaxed nonanticipativity constraints. A limitation of this approach is that in many problems perfect information about uncertainties is quite valuable, and thus, the resulting bound is weak. In this paper, we use an information relaxation duality approach, which includes a penalty that punishes violations of the nonanticipativity constraints, to derive stronger analytical bounds on the suboptimality of heuristic policies in stochastic dynamic programs that are too difficult to solve. The general framework we develop ties the heuristic policy and the performance bound together explicitly through the use of an approximate value function: heuristic policies are greedy with respect to this approximation, and penalties are also generated in a specific way using this approximation. We apply this approach to three challenging problems: stochastic knapsack problems, stochastic scheduling on parallel machines, and sequential search problems. In each of these problems, we consider a greedy heuristic policy generated by an approximate value function and a corresponding penalized perfect information bound. We then characterize the gap between the performance of the policy and the information relaxation bound in each problem; the results imply asymptotic optimality of the heuristic policy for specific “large” regimes of interest.

Duke Scholars

Author David B. Brown Fuqua School of Business

Published In

Operations Research

DOI

10.1287/opre.2018.1782

EISSN

1526-5463

ISSN

0030-364X

Publication Date

January 1, 2019

Volume

Issue

Start / End Page

577 / 597

Related Subject Headings

Operations Research
3507 Strategy, management and organisational behaviour
1503 Business and Management
0802 Computation Theory and Mathematics
0102 Applied Mathematics

Citation

APA

Chicago

ICMJE

MLA

NLM

Balseiro, S. R., & Brown, D. B. (2019). Approximations to stochastic dynamic programs via information relaxation duality. Operations Research, 67(2), 577–597. https://doi.org/10.1287/opre.2018.1782

Balseiro, S. R., and D. B. Brown. “Approximations to stochastic dynamic programs via information relaxation duality.” Operations Research 67, no. 2 (January 1, 2019): 577–97. https://doi.org/10.1287/opre.2018.1782.

Balseiro SR, Brown DB. Approximations to stochastic dynamic programs via information relaxation duality. Operations Research. 2019 Jan 1;67(2):577–97.

Balseiro, S. R., and D. B. Brown. “Approximations to stochastic dynamic programs via information relaxation duality.” Operations Research, vol. 67, no. 2, Jan. 2019, pp. 577–97. Scopus, doi:10.1287/opre.2018.1782.

Balseiro SR, Brown DB. Approximations to stochastic dynamic programs via information relaxation duality. Operations Research. 2019 Jan 1;67(2):577–597.

Published In

Operations Research

DOI

10.1287/opre.2018.1782

EISSN

1526-5463

ISSN

0030-364X

Publication Date

January 1, 2019

Volume

Issue

Start / End Page

577 / 597

Related Subject Headings

Operations Research
3507 Strategy, management and organisational behaviour
1503 Business and Management
0802 Computation Theory and Mathematics
0102 Applied Mathematics