Skip to main content

A central limit theorem for temporally nonhomogenous markov chains with applications to dynamic programming

Publication ,  Journal Article
Arlotto, A; Michael Steele, J
Published in: Mathematics of Operations Research
November 1, 2016

We prove a central limit theorem for a class of additive processes that arise naturally in the theory of finite horizon Markov decision problems. The main theorem generalizes a classic result of Dobrushin for temporally nonhomogeneous Markov chains, and the principal innovation is that here the summands are permitted to depend on both the current state and a bounded number of future states of the chain. We show through several examples that this added flexibility gives one a direct path to asymptotic normality of the optimal total reward of finite horizon Markov decision problems. The same examples also explain why such results are not easily obtained by alternative Markovian techniques such as enlargement of the state space.

Duke Scholars

Published In

Mathematics of Operations Research

DOI

EISSN

1526-5471

ISSN

0364-765X

Publication Date

November 1, 2016

Volume

41

Issue

4

Start / End Page

1448 / 1468

Related Subject Headings

  • Operations Research
  • 4901 Applied mathematics
  • 0802 Computation Theory and Mathematics
  • 0103 Numerical and Computational Mathematics
  • 0102 Applied Mathematics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Arlotto, A., & Michael Steele, J. (2016). A central limit theorem for temporally nonhomogenous markov chains with applications to dynamic programming. Mathematics of Operations Research, 41(4), 1448–1468. https://doi.org/10.1287/moor.2016.0784
Arlotto, A., and J. Michael Steele. “A central limit theorem for temporally nonhomogenous markov chains with applications to dynamic programming.” Mathematics of Operations Research 41, no. 4 (November 1, 2016): 1448–68. https://doi.org/10.1287/moor.2016.0784.
Arlotto A, Michael Steele J. A central limit theorem for temporally nonhomogenous markov chains with applications to dynamic programming. Mathematics of Operations Research. 2016 Nov 1;41(4):1448–68.
Arlotto, A., and J. Michael Steele. “A central limit theorem for temporally nonhomogenous markov chains with applications to dynamic programming.” Mathematics of Operations Research, vol. 41, no. 4, Nov. 2016, pp. 1448–68. Scopus, doi:10.1287/moor.2016.0784.
Arlotto A, Michael Steele J. A central limit theorem for temporally nonhomogenous markov chains with applications to dynamic programming. Mathematics of Operations Research. 2016 Nov 1;41(4):1448–1468.

Published In

Mathematics of Operations Research

DOI

EISSN

1526-5471

ISSN

0364-765X

Publication Date

November 1, 2016

Volume

41

Issue

4

Start / End Page

1448 / 1468

Related Subject Headings

  • Operations Research
  • 4901 Applied mathematics
  • 0802 Computation Theory and Mathematics
  • 0103 Numerical and Computational Mathematics
  • 0102 Applied Mathematics