不确定性环境下基于深度强化学习的综合能源系统动态调度
CSTR:
作者:
作者单位:

(1.北京交通大学电气工程学院,北京 100044;2.中国电力科学研究院有限公司,北京 100192)

作者简介:

蔺伟山(1997—),男,硕士研究生,研究方向为人工智能在综合能源系统中的应用;E-mail: 20121455@bjtu.edu.cn 王小君(1978—),男,通信作者,博士,教授,研究方向为电力系统分析与控制、综合能源系统优化运行;E-mail: xjwang1@bjtu.edu.cn 孙庆凯(1992—),男,博士研究生,研究方向为人工智能在综合能源系统中的应用。E-mail: sunqingkai_123@ 163.com

通讯作者:

中图分类号:

基金项目:

国家自然科学基金项目资助(51977005)


Dynamic dispatch of an integrated energy system based on deep reinforcement learning in an uncertain environment
Author:
Affiliation:

(1. School of Electrical Engineering, Beijing Jiaotong University, Beijing 100044, China; 2. China Electric Power Research Institute Co., Ltd., Beijing 100192, China)

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着综合能源系统中间歇性能源和负荷不确定性的逐步增强,传统的调度方法局限于固定物理模型及参数设定,难以较好地动态响应源荷的随机波动。针对这一问题,提出了一种基于深度强化学习的综合能源系统动态调度方法。首先,以数据驱动方式构建面向综合能源系统的深度强化学习模型,通过智能体与综合能源系统的持续交互,自适应学习调度策略,降低对物理模型的依赖程度。其次,通过添加随机扰动的方式表征源荷不确定性变化特征,针对不确定性变化特征改进深度强化学习模型的状态空间、动作空间、奖励机制以及训练流程等关键环节,并经由近端策略优化算法优化求解,实现了综合能源系统的动态调度决策。最后,通过算例仿真验证了所提方法在不同时间尺度以及不确定性环境下的可行性和有效性。

    Abstract:

    As the uncertainties of intermittent energy and load in the integrated energy system gradually increase, traditional dispatch methods are limited to fixed physical models and parameter settings that can hardly respond to the random fluctuations in the dynamic system with source-load. In this paper, a deep reinforcement learning-based dynamic dispatch method for the integrated energy system is proposed to address this problem. First, a data-driven deep reinforcement learning model is constructed for the integrated energy system. Through the continuous interaction between agent and integrated energy system, the dispatch strategies are learned adaptively to reduce dependence on the physical models. Secondly, the variations of source-load uncertainties are characterized by adding random disturbances. Pivotal aspects such as state spaces, action spaces, reward mechanisms and the training process of the deep reinforcement learning model are improved according to the characteristics of uncertainties. Then a proximal policy optimization algorithm is used to solve the problem, and the dynamic dispatch decisions of the integrated energy system are realized. Finally, simulation results verify the feasibility and effectiveness of the proposed method over different time scales and in uncertain environments. This work is supported by the National Natural Science Foundation of China (No. 51977005).

    参考文献
    相似文献
    引证文献
引用本文

蔺伟山,王小君,孙庆凯,等.不确定性环境下基于深度强化学习的综合能源系统动态调度[J].电力系统保护与控制,2022,50(18):50-60.[LIN Weishan, WANG Xiaojun, SUN Qingkai, et al. Dynamic dispatch of an integrated energy system based on deep reinforcement learning in an uncertain environment[J]. Power System Protection and Control,2022,V50(18):50-60]

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-12-10
  • 最后修改日期:2022-01-24
  • 录用日期:
  • 在线发布日期: 2022-09-15
  • 出版日期:
文章二维码