Deep Reinforcement Learning Based Active Network Management and Emergency Load-Shedding Control for Power Systems

장호천; Sun, Xinfeng; Lee, Myoung Hoon; Moon, Jun

doi:10.1109/TSG.2023.3302846

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Deep Reinforcement Learning Based Active Network Management and Emergency Load-Shedding Control for Power SystemsDeep Reinforcement Learning-Based Active Network Management and Emergency Load-Shedding Control for Power Systems

Other Titles: Deep Reinforcement Learning-Based Active Network Management and Emergency Load-Shedding Control for Power Systems

Authors: 장호천; Sun, Xinfeng; Lee, Myoung Hoon; Moon, Jun

Issue Date: Mar-2024

Publisher: Institute of Electrical and Electronics Engineers

Keywords: active network management; Deep reinforcement learning; emergency control; Inference algorithms; load shedding; Power system stability; Power systems; safe reinforcement learning; Safety; Task analysis; Training; Voltage control

Citation: IEEE Transactions on Smart Grid, v.15, no.2, pp 1423 - 1437

Pages: 15

Indexed: SCIE
SCOPUS

Journal Title: IEEE Transactions on Smart Grid

Volume: 15

Number: 2

Start Page: 1423

End Page: 1437

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/195141

DOI: 10.1109/TSG.2023.3302846

ISSN: 1949-3053
1949-3061

Abstract: This paper presents two novel deep reinforcement learning (DRL) approaches aimed at solving complex power system control problems in a data-driven sense to maintain the stability of power systems. Specifically, we propose, respectively, SACPER (Soft Actor-Critic (SAC) with Prioritized Experience Replay (PER)) and Constrained Variational Policy Optimization (CVPO) DRL algorithms to address the sequential decision-making problem of active network management (ANM) in distributed power systems and optimizing emergency load shedding (ELS) control problems. First, we propose SACPER for the ANM problem, which prioritizes the training of samples with large errors and poor policy performance. Evaluation of SACPER in terms of stability improvement and convergence speed shows that the ANM problem is optimized and energy loss and operational constraint violations are minimized. Next, we introduce CVPO for the ELS control problem, which is formulated as the Safe Reinforcement Learning (SRL) framework to address safety constraint prioritization issues in power systems. We consider additional voltage variables in the network as strong constraints for SRL to achieve fast voltage recovery and minimize unnecessary energy loss, while ensuring good training performance and efficiency. To demonstrate the performances of SACPER, we apply it to ANM6-Easy environment. The CVPO algorithm is applied to IEEE 39-Bus and IEEE 300-Bus systems. The simulation results of SACPER and CVPO are validated through extensive comparisons with other state-of-the-art DRL approaches.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Moon, Jun photo

Moon, Jun: COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE