Skip to main content

Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives

Publication ,  Journal Article
Bozkurt, AK; Wang, Y; Zavlanos, MM; Pajic, M
Published in: Proceedings - IEEE International Conference on Robotics and Automation
January 1, 2021

We study synthesis of control strategies from linear temporal logic (LTL) objectives in unknown environments. We model this problem as a turn-based zero-sum stochastic game between the controller and the environment, where the transition probabilities and the model topology are fully unknown. The winning condition for the controller in this game is the satisfaction of the given LTL specification, which can be captured by the acceptance condition of a deterministic Rabin automaton (DRA) directly derived from the LTL specification. We introduce a model-free reinforcement learning (RL) methodology to find a strategy that maximizes the probability of satisfying a given LTL specification when the Rabin condition of the derived DRA has a single accepting pair. We then generalize this approach to any LTL formulas, for which the Rabin accepting condition may have more than one pairs, providing a lower bound on the satisfaction probability. Finally, we show applicability of our RL method on two planning case studies.

Duke Scholars

Published In

Proceedings - IEEE International Conference on Robotics and Automation

DOI

ISSN

1050-4729

Publication Date

January 1, 2021

Volume

2021-May

Start / End Page

10649 / 10655
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Bozkurt, A. K., Wang, Y., Zavlanos, M. M., & Pajic, M. (2021). Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives. Proceedings - IEEE International Conference on Robotics and Automation, 2021-May, 10649–10655. https://doi.org/10.1109/ICRA48506.2021.9561989
Bozkurt, A. K., Y. Wang, M. M. Zavlanos, and M. Pajic. “Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives.” Proceedings - IEEE International Conference on Robotics and Automation 2021-May (January 1, 2021): 10649–55. https://doi.org/10.1109/ICRA48506.2021.9561989.
Bozkurt AK, Wang Y, Zavlanos MM, Pajic M. Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives. Proceedings - IEEE International Conference on Robotics and Automation. 2021 Jan 1;2021-May:10649–55.
Bozkurt, A. K., et al. “Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives.” Proceedings - IEEE International Conference on Robotics and Automation, vol. 2021-May, Jan. 2021, pp. 10649–55. Scopus, doi:10.1109/ICRA48506.2021.9561989.
Bozkurt AK, Wang Y, Zavlanos MM, Pajic M. Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives. Proceedings - IEEE International Conference on Robotics and Automation. 2021 Jan 1;2021-May:10649–10655.

Published In

Proceedings - IEEE International Conference on Robotics and Automation

DOI

ISSN

1050-4729

Publication Date

January 1, 2021

Volume

2021-May

Start / End Page

10649 / 10655