一种基于深度强化学习算法的电网有功安全校正方法
CSTR:
作者:
作者单位:

(1.华北电力大学电气与电子工程学院,河北 保定 071003;2.国网河北省电力有限公司,河北 石家庄 050021)

作者简介:

孙立钧(1997—),男,通信作者,硕士研究生,主要研究方向为人工智能技术及其在电力系统中的应用、电力系统安全评估与控制;E-mail: does877@163.com 顾雪平(1964—),男,博士,教授,博士研究生导师,主要研究方向为电力系统安全稳定评估与控制、电力系统安全防御与恢复控制、智能技术在电力系统中的应用;E-mail:xpgu@ncepu.edu.cn 刘 彤(1996—),女,博士研究生,主要研究方向为人工智能技术及其在电力系统中的应用、电力系统安全评估与控制。E-mail: tongliu_1996@163.com

通讯作者:

中图分类号:

基金项目:

国家电网公司科技项目资助(SGTYHT/17-JS-199)


A deep reinforcement learning algorithm-based active safety correction method for power grids
Author:
Affiliation:

(1. School of Electrical and Electronic Engineering, North China Electric Power University, Baoding 071003, China; 2. State Grid Hebei Electric Power Co., Ltd., Shijiazhuang 050021, China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    电力系统有功安全校正对于保障电网安全运行具有重要意义。传统有功安全校正方法无法综合考虑系统潮流分布状态和机组的调整性能,求解效率低、涉及调整的机组多,存在调整反复的现象,在实际应用中具有一定困难。因此,采用深度强化学习算法,提出一种基于深度Q网络(Deep Q Network, DQN)的有功安全校正策略。首先,建立系统有功安全校正模型。其次,采用卷积神经网络(Convolutional Neural Networks, CNN)挖掘电网运行状态深层特征。进一步利用DQN算法通过“状态-动作”机制,以“奖励”为媒介,构建电网运行状态与最优调整机组组合的映射模型,确定调整机组。最后,根据过载线路对调整机组的灵敏度,计算得到调整量。IEEE39节点系统的验证结果表明,所提出的有功安全校正策略在处理多线路过载时可综合考虑系统潮流分布的总体状况和机组调节性能,高效地消除线路过载。

    Abstract:

    Active safety correction of a power system is of great importance in ensuring the safe operation of a power grid. The traditional active safety correction method cannot comprehensively consider the system power flow distribution state and the adjustment performance of the units, and has difficulties in practical application because of the low solution efficiency, the adjustment involved in many units, and the need for repeated adjustment. Therefore, an active safety correction strategy based on the deep Q network (DQN) by using a deep reinforcement learning algorithm is proposed. First, a system active safety correction model is established; secondly, convolutional neural networks (CNN) are used to explore the deep features of the grid operation state. The DQN algorithm is used to construct a mapping model of the combination of power grid operation state and optimal adjustment unit through the mechanism of "state-action" and the medium of "reward", and the adjustment unit is determined. Finally, the adjustment quantity is calculated according to the sensitivity of overload line to the adjusting unit. The validation results of the IEEE39-bus system show that the active safety correction strategy proposed can comprehensively consider the overall situation of system power flow distribution and unit regulation performance when dealing with multi-line overload, and effectively eliminate line overload. This work is supported by the Science and Technology Project of State Grid Corporation of China (No. SGTYHT/17-JS-199).

    参考文献
    相似文献
    引证文献
引用本文

孙立钧,顾雪平,刘 彤,等.一种基于深度强化学习算法的电网有功安全校正方法[J].电力系统保护与控制,2022,50(10):114-122.[SUN Lijun, GU Xueping, LIU Tong, et al. A deep reinforcement learning algorithm-based active safety correction method for power grids[J]. Power System Protection and Control,2022,V50(10):114-122]

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-07-16
  • 最后修改日期:2021-09-02
  • 录用日期:
  • 在线发布日期: 2022-05-24
  • 出版日期:
文章二维码
关闭
关闭