Option-Based Deep Reinforcement Learning for Topology Control of Power Systemsopen access
- Authors
- Zhang, Haotian; Wang, Chen; Lee, Myoung Hoon; Moon, Jun
- Issue Date
- Feb-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Power system stability; Topology; Long short term memory; Control systems; Power systems; Decision making; Optimization; Network topology; Feature extraction; Accuracy; Deep reinforcement learning; option-critic framework; topology control; smart grid
- Citation
- IEEE Access, v.13, pp 26639 - 26650
- Pages
- 12
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 13
- Start Page
- 26639
- End Page
- 26650
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206695
- DOI
- 10.1109/ACCESS.2025.3539770
- ISSN
- 2169-3536
2169-3536
- Abstract
- In this paper, we propose an option-based deep reinforcement learning (DRL) algorithm called option-critic with long short-term memory (OC-LSTM), which combines the option-critic (OC) framework containing a hierarchical policy structure with a long short-term memory (LSTM) network, which makes full use of the powerful time-series feature extraction capability of LSTM networks and uses the OC framework to learn power system topology control policy of the power system. Specifically, in a complex and variable power system, the OC-LSTM extracts key power system state information through the LSTM network and uses the OC framework to define and optimize the high-level options and low-level action policy, which effectively reduces the dimensionality of the agent's topology control action space in the decision-making process. This combination improves the accuracy of topology control policies and effectively maintains the stability of the power system. The experimental results show that the OC-LSTM algorithm outperforms the benchmark DRL algorithm during training, with the ablation experiment further highlighting the effectiveness of LSTM in power system feature extraction. Additionally, the OC-LSTM algorithm enables stable operation of the IEEE 5-Bus, IEEE 14-Bus, and L2RPN WCCI 2020 power systems for 60 hours, all without human expert intervention.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.