Skip to main content

Skill Discovery for Exploration and Planning using Deep Skill Graphs

Publication ,  Conference
Bagaria, A; Senthil, J; Konidaris, G
Published in: Proceedings of Machine Learning Research
January 1, 2021

We introduce a new skill-discovery algorithm that builds a discrete graph representation of large continuous MDPs, where nodes correspond to skill subgoals and the edges to skill policies. The agent constructs this graph during an unsupervised training phase where it interleaves discovering skills and planning using them to gain coverage over ever-increasing portions of the state-space. Given a novel goal at test time, the agent plans with the acquired skill graph to reach a nearby state, then switches to learning to reach the goal. We show that the resulting algorithm, Deep Skill Graphs, outperforms both flat and existing hierarchical reinforcement learning methods on four difficult continuous control tasks.

Published In

Proceedings of Machine Learning Research

EISSN

2640-3498

Publication Date

January 1, 2021

Volume

139

Start / End Page

521 / 531
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Bagaria, A., Senthil, J., & Konidaris, G. (2021). Skill Discovery for Exploration and Planning using Deep Skill Graphs. In Proceedings of Machine Learning Research (Vol. 139, pp. 521–531).
Bagaria, A., J. Senthil, and G. Konidaris. “Skill Discovery for Exploration and Planning using Deep Skill Graphs.” In Proceedings of Machine Learning Research, 139:521–31, 2021.
Bagaria A, Senthil J, Konidaris G. Skill Discovery for Exploration and Planning using Deep Skill Graphs. In: Proceedings of Machine Learning Research. 2021. p. 521–31.
Bagaria, A., et al. “Skill Discovery for Exploration and Planning using Deep Skill Graphs.” Proceedings of Machine Learning Research, vol. 139, 2021, pp. 521–31.
Bagaria A, Senthil J, Konidaris G. Skill Discovery for Exploration and Planning using Deep Skill Graphs. Proceedings of Machine Learning Research. 2021. p. 521–531.

Published In

Proceedings of Machine Learning Research

EISSN

2640-3498

Publication Date

January 1, 2021

Volume

139

Start / End Page

521 / 531