Skip to main content

Action Set Based Policy Optimization for Safe Power Grid Management

Publication ,  Conference
Zhou, B; Zeng, H; Liu, Y; Li, K; Wang, F; Tian, H
Published in: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics
January 1, 2021

Maintaining the stability of the modern power grid is becoming increasingly difficult due to fluctuating power consumption, unstable power supply coming from renewable energies, and unpredictable accidents such as man-made and natural disasters. As the operation on the power grid must consider its impact on future stability, reinforcement learning (RL) has been employed to provide sequential decision-making in power grid management. However, existing methods have not considered the environmental constraints. As a result, the learned policy has risk of selecting actions that violate the constraints in emergencies, which will escalate the issue of overloaded power lines and lead to large-scale blackouts. In this work, we propose a novel method for this problem, which builds on top of the search-based planning algorithm. At the planning stage, the search space is limited to the action set produced by the policy. The selected action strictly follows the constraints by testing its outcome with the simulation function provided by the system. At the learning stage, to address the problem that gradients cannot be propagated to the policy, we introduce Evolutionary Strategies (ES) with black-box policy optimization to improve the policy directly, maximizing the returns of the long run. In NeurIPS 2020 Learning to Run Power Network (L2RPN) competition, our solution safely managed the power grid and ranked first in both tracks.

Duke Scholars

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2021

Volume

12979 LNAI

Start / End Page

168 / 181

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zhou, B., Zeng, H., Liu, Y., Li, K., Wang, F., & Tian, H. (2021). Action Set Based Policy Optimization for Safe Power Grid Management. In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics (Vol. 12979 LNAI, pp. 168–181). https://doi.org/10.1007/978-3-030-86517-7_11
Zhou, B., H. Zeng, Y. Liu, K. Li, F. Wang, and H. Tian. “Action Set Based Policy Optimization for Safe Power Grid Management.” In Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, 12979 LNAI:168–81, 2021. https://doi.org/10.1007/978-3-030-86517-7_11.
Zhou B, Zeng H, Liu Y, Li K, Wang F, Tian H. Action Set Based Policy Optimization for Safe Power Grid Management. In: Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2021. p. 168–81.
Zhou, B., et al. “Action Set Based Policy Optimization for Safe Power Grid Management.” Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics, vol. 12979 LNAI, 2021, pp. 168–81. Scopus, doi:10.1007/978-3-030-86517-7_11.
Zhou B, Zeng H, Liu Y, Li K, Wang F, Tian H. Action Set Based Policy Optimization for Safe Power Grid Management. Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics. 2021. p. 168–181.

Published In

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

DOI

EISSN

1611-3349

ISSN

0302-9743

Publication Date

January 1, 2021

Volume

12979 LNAI

Start / End Page

168 / 181

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences