Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations
Publication
, Conference
Doshi-Velez, F; Konidaris, G
Published in: IJCAI International Joint Conference on Artificial Intelligence
January 1, 2016
Control applications often feature tasks with similar, but not identical, dynamics. We introduce the Hidden Parameter Markov Decision Process (HiPMDP), a framework that parametrizes a family of related dynamical systems with a low-dimensional set of latent factors, and introduce a semiparametric regression approach for learning its structure from data. We show that a learned HiP-MDP rapidly identifies the dynamics of new task instances in several settings, flexibly adapting to task variation.
Duke Scholars
Published In
IJCAI International Joint Conference on Artificial Intelligence
ISSN
1045-0823
Publication Date
January 1, 2016
Volume
2016-January
Start / End Page
1432 / 1440
Citation
APA
Chicago
ICMJE
MLA
NLM
Doshi-Velez, F., & Konidaris, G. (2016). Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations. In IJCAI International Joint Conference on Artificial Intelligence (Vol. 2016-January, pp. 1432–1440).
Doshi-Velez, F., and G. Konidaris. “Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations.” In IJCAI International Joint Conference on Artificial Intelligence, 2016-January:1432–40, 2016.
Doshi-Velez F, Konidaris G. Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations. In: IJCAI International Joint Conference on Artificial Intelligence. 2016. p. 1432–40.
Doshi-Velez, F., and G. Konidaris. “Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations.” IJCAI International Joint Conference on Artificial Intelligence, vol. 2016-January, 2016, pp. 1432–40.
Doshi-Velez F, Konidaris G. Hidden parameter markov decision processes: A semiparametric regression approach for discovering latent task parametrizations. IJCAI International Joint Conference on Artificial Intelligence. 2016. p. 1432–1440.
Published In
IJCAI International Joint Conference on Artificial Intelligence
ISSN
1045-0823
Publication Date
January 1, 2016
Volume
2016-January
Start / End Page
1432 / 1440