基于深度强化学习的电力系统暂态稳定快关汽门紧急控制策略

doi:10.19783/j.cnki.pspc.241593

首页 > 过刊浏览>2025年第53卷第19期 >175-187. DOI:10.19783/j.cnki.pspc.241593

基于深度强化学习的电力系统暂态稳定快关汽门紧急控制策略
DOI:
                        10.19783/j.cnki.pspc.241593
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:1.现代电力系统仿真控制与绿色电能新技术教育部重点实验室(东北电力大学)，吉林 吉林 132012; 2.国网河北省电力有限公司衡水供电分公司，河北 衡水 053000
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金项目资助(52277084)；吉林省国际科技合作项目资助(20230402074GH)

Fast valving emergency control strategy for power system transient stability based on deep reinforcement learning

Author:

Affiliation:

1. Key Laboratory of Modern Power System Simulation and Control & Renewable Energy Technology, Ministry of Education (Northeast Electric Power University), Jilin 132012, China; 2. Hengshui Power Supply Branch, State Grid Hebei Electric Power Co., Ltd., Hengshui 053000, China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

快关汽门是提升电力系统暂态稳定性的经典控制方式之一，但其控制变量具有高维度、离散化的特点，且参数整定不合理将引发功角后续摇摆失稳，控制策略制定的复杂性致使快关汽门难以在线应用与实时决策。为此，提出了基于深度强化学习的快关汽门控制决策方法。首先，构建基于深度强化学习的紧急快关汽门决策制定框架。然后，将快关汽门控制问题转化为马尔可夫决策过程(Markov decision process, MDP)，以综合考虑最优稳定控制效果及最小化稳控代价为目标设置奖励函数，并采用近端策略优化(proximal policy optimization, PPO)算法求解，得到快关策略的合理配置。最后，通过改进的电科院SG-77系统验证所提方法的有效性。仿真结果表明所提方法在保证快关汽门策略有效性与时效性的同时，可实现在预案式失配场景下作出正确决策，提高了电力系统的暂态稳定性和动态响应能力。

Abstract:

Fast valving is one of the classic control methods to improve transient stability in power systems. However, its control variables are high-dimensional and discrete, and improper parameter tuning may trigger subsequent power angle oscillations and instability. The complexity of the control strategy development makes it difficult to apply fast valving closure in online and real-time decision-making. To address this challenge, a fast valving control decision method based on deep reinforcement learning is proposed. First, a deep reinforcement learning-based emergency fast valving decision-making framework is constructed. Then, the fast valving control problem is transformed into a Markov decision process (MDP). A reward function is designed to balance optimal stability control performance and minimized control cost, and the proximal policy optimization (PPO) algorithm is used to solve it, yielding a rational configuration of the fast valving strategy. Finally, the effectiveness of the proposed method is verified using the improved SG-77 system developed by CEPRI. Simulation results show that the proposed method ensures both the effectiveness and timeliness of the fast valving strategy, enabling correct decision-making under mismatched contingency scenarios, and improves transient stability and dynamic response capability of power systems.

参考文献

相似文献

引证文献

引用本文

孙正龙,陈威翰,耿鑫地,等.基于深度强化学习的电力系统暂态稳定快关汽门紧急控制策略[J].电力系统保护与控制,2025,53(19):175-187.[SUN Zhenglong, CHEN Weihan, GENG Xindi, et al. Fast valving emergency control strategy for power system transient stability based on deep reinforcement learning[J]. Power System Protection and Control,2025,V53(19):175-187]

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2024-11-29
最后修改日期:2025-06-05
录用日期:
在线发布日期: 2025-09-28
出版日期:

首页

期刊介绍

编委会

投稿指南

期刊影响力

广告与发行

企业展台

English

PCMP英文刊

新能源汽车供能技术

引用本文

分享

相关视频

文章指标

历史

文章二维码