Robust and efficient transfer learning with hidden parameter Markov decision processes
Publication
, Conference
Killian, T; Daulton, S; Konidaris, G; Doshi-Velez, F
Published in: Advances in Neural Information Processing Systems
January 1, 2017
We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace the original Gaussian Process-based model with a Bayesian Neural Network, enabling more scalable inference. Thus, we expand the scope of the HiP-MDP to applications with higher dimensions and more complex dynamics.
Duke Scholars
Published In
Advances in Neural Information Processing Systems
ISSN
1049-5258
Publication Date
January 1, 2017
Volume
2017-December
Start / End Page
6251 / 6262
Related Subject Headings
- 4611 Machine learning
- 1702 Cognitive Sciences
- 1701 Psychology
Citation
APA
Chicago
ICMJE
MLA
NLM
Killian, T., Daulton, S., Konidaris, G., & Doshi-Velez, F. (2017). Robust and efficient transfer learning with hidden parameter Markov decision processes. In Advances in Neural Information Processing Systems (Vol. 2017-December, pp. 6251–6262).
Killian, T., S. Daulton, G. Konidaris, and F. Doshi-Velez. “Robust and efficient transfer learning with hidden parameter Markov decision processes.” In Advances in Neural Information Processing Systems, 2017-December:6251–62, 2017.
Killian T, Daulton S, Konidaris G, Doshi-Velez F. Robust and efficient transfer learning with hidden parameter Markov decision processes. In: Advances in Neural Information Processing Systems. 2017. p. 6251–62.
Killian, T., et al. “Robust and efficient transfer learning with hidden parameter Markov decision processes.” Advances in Neural Information Processing Systems, vol. 2017-December, 2017, pp. 6251–62.
Killian T, Daulton S, Konidaris G, Doshi-Velez F. Robust and efficient transfer learning with hidden parameter Markov decision processes. Advances in Neural Information Processing Systems. 2017. p. 6251–6262.
Published In
Advances in Neural Information Processing Systems
ISSN
1049-5258
Publication Date
January 1, 2017
Volume
2017-December
Start / End Page
6251 / 6262
Related Subject Headings
- 4611 Machine learning
- 1702 Cognitive Sciences
- 1701 Psychology