Option-Based Deep Reinforcement Learning for Topology Control of Power Systems

Zhang, Haotian; Wang, Chen; Lee, Myoung Hoon; Moon, Jun

doi:10.1109/ACCESS.2025.3539770

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Option-Based Deep Reinforcement Learning for Topology Control of Power Systemsopen access

Authors: Zhang, Haotian; Wang, Chen; Lee, Myoung Hoon; Moon, Jun

Issue Date: Feb-2025

Publisher: Institute of Electrical and Electronics Engineers Inc.

Keywords: Power system stability; Topology; Long short term memory; Control systems; Power systems; Decision making; Optimization; Network topology; Feature extraction; Accuracy; Deep reinforcement learning; option-critic framework; topology control; smart grid

Citation: IEEE Access, v.13, pp 26639 - 26650

Pages: 12

Indexed: SCIE
SCOPUS

Journal Title: IEEE Access

Volume: 13

Start Page: 26639

End Page: 26650

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/206695

DOI: 10.1109/ACCESS.2025.3539770

ISSN: 2169-3536
2169-3536

Abstract: In this paper, we propose an option-based deep reinforcement learning (DRL) algorithm called option-critic with long short-term memory (OC-LSTM), which combines the option-critic (OC) framework containing a hierarchical policy structure with a long short-term memory (LSTM) network, which makes full use of the powerful time-series feature extraction capability of LSTM networks and uses the OC framework to learn power system topology control policy of the power system. Specifically, in a complex and variable power system, the OC-LSTM extracts key power system state information through the LSTM network and uses the OC framework to define and optimize the high-level options and low-level action policy, which effectively reduces the dimensionality of the agent's topology control action space in the decision-making process. This combination improves the accuracy of topology control policies and effectively maintains the stability of the power system. The experimental results show that the OC-LSTM algorithm outperforms the benchmark DRL algorithm during training, with the ablation experiment further highlighting the effectiveness of LSTM in power system feature extraction. Additionally, the OC-LSTM algorithm enables stable operation of the IEEE 5-Bus, IEEE 14-Bus, and L2RPN WCCI 2020 power systems for 60 hours, all without human expert intervention.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Moon, Jun photo

Moon, Jun: COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE