Skip to main content

Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations

Publication ,  Conference
Doshi-Velez, F; Konidaris, G
Published in: IJCAI International Joint Conference on Artificial Intelligence
January 1, 2016

Control applications often feature tasks with similar, but not identical, dynamics. We introduce the Hidden Parameter Markov Decision Process (HiPMDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric regression approach for learning its structure from data. We show that a learned HiP-MDP rapidly identifies the dynamics of new task instances in several settings, flexibly adapting to task variation.

Duke Scholars

Published In

IJCAI International Joint Conference on Artificial Intelligence

ISSN

1045-0823

Publication Date

January 1, 2016

Volume

2016-January

Start / End Page

1432 / 1440
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Doshi-Velez, F., & Konidaris, G. (2016). Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2016-January, pp. 1432–1440).
Doshi-Velez, F., and G. Konidaris. “Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations.” In IJCAI International Joint Conference on Artificial Intelligence, 2016-January:1432–40, 2016.
Doshi-Velez F, Konidaris G. Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations. In: IJCAI International Joint Conference on Artificial Intelligence. 2016. p. 1432–40.
Doshi-Velez, F., and G. Konidaris. “Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations.” IJCAI International Joint Conference on Artificial Intelligence, vol. 2016-January, 2016, pp. 1432–40.
Doshi-Velez F, Konidaris G. Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations. IJCAI International Joint Conference on Artificial Intelligence. 2016. p. 1432–1440.

Published In

IJCAI International Joint Conference on Artificial Intelligence

ISSN

1045-0823

Publication Date

January 1, 2016

Volume

2016-January

Start / End Page

1432 / 1440