基于反事实多智能体强化学习和有功无功协同控制的配电网电压优化

doi:10.19783/j.cnki.pspc.231477

首页 > 过刊浏览>2024年第52卷第18期 >76-86. DOI:10.19783/j.cnki.pspc.231477

基于反事实多智能体强化学习和有功无功协同控制的配电网电压优化
DOI:
                        10.19783/j.cnki.pspc.231477
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:1.天津大学电气自动化与信息工程学院，天津 300072;2.合肥工业大学电气与自动化工程学院， 安徽 合肥 230009;3.国网江西省电力有限公司电力科学研究院，江西 南昌 330096
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金项目资助(52207130)；江西省重点研发计划项目资助(20223BBE51013)

Active and reactive power coordinated optimal voltage control of a distribution network based on counterfactual multi-agent reinforcement learning

Author:

Affiliation:

1. School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China; 2. School of Electrical and Automation Engineering, Hefei University of Technology, Hefei 230009, China; 3. State Grid Jiangxi Electric Power Research Institute, Nanchang 330096, China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

大量分布式电源的接入使配电网的结构与控制方式发生改变。针对分布式电源间歇性和波动性引起的电压越限问题，通过调节系统中无功潮流与有功潮流的分布来维持配电网的电压稳定。提出了一种基于反事实多智能体策略梯度(counterfactual multi-agent policy gradients, COMA)算法的配电网电压协同优化方法，通过反事实基线解决了多智能体强化学习中的“信度分配”问题，实现有功出力设备和无功补偿设备的联合优化调度。智能体通过局部观测值选定动作，减轻系统的通信压力，且不依赖精确的潮流模型，以实现配电网的实时优化控制。通过改进的IEEE33节点系统和141节点系统验证了所提算法的可行性与有效性。并与经典算法的控制效果进行比较，进一步证明所提算法在配电网电压优化控制方面的性能优势。

Abstract:

The integration of a significant number of distributed generators has altered the structure and control methods in distribution networks. To address the voltage stability issues caused by the intermittency and fluctuation of distributed generators, this paper proposes the stabilization of the distribution network voltage by adjusting the distribution of reactive and active power flows within the system. A distribution network voltage coordinated optimization method is proposed based on the counterfactual multi-agent policy gradients (COMA) algorithm. The proposed method can use a counterfactual baseline to resolve the “credit assignment” challenge in multi-agent reinforcement learning, enabling the joint optimization scheduling of active power generation and reactive power compensation devices. Agents select actions based on local observations, thereby reducing the system’s communication load and eliminating the dependency on precise flow models, to achieve real-time optimization control of distribution networks. The feasibility and effectiveness of the proposed algorithm are demonstrated by using the improved IEEE33-node system and 141-node system. Compared with the classic control algorithms, the proposed method has further performance advantages in the voltage optimization and control problems for distribution networks.

参考文献

相似文献

引证文献

引用本文

张梓枭,崔明建,张程彬,等.基于反事实多智能体强化学习和有功无功协同控制的配电网电压优化[J].电力系统保护与控制,2024,52(18):76-86.[ZHANG Zixiao, CUI Mingjian, ZHANG Chengbin, et al. Active and reactive power coordinated optimal voltage control of a distribution network based on counterfactual multi-agent reinforcement learning[J]. Power System Protection and Control,2024,V52(18):76-86]

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-12-28
最后修改日期:2024-04-18
录用日期:
在线发布日期: 2024-09-13
出版日期:

首页

期刊介绍

编委会

投稿指南

期刊影响力

广告与发行

企业展台

English

PCMP英文刊

新能源汽车供能技术

引用本文

分享

相关视频

文章指标

历史

文章二维码