基于机器学习的配电网异常缺失数据动态清洗方法
CSTR:
作者:
作者单位:

1.湖南大学电气与信息工程学院,湖南 长沙 410082;2.国网温州供电公司,浙江 温州 325000; 3.国网湖南省电力有限公司电力科学研究院,湖南 长沙 410007

作者简介:

梅玉杰(1998—),男,硕士研究生,主要研究方向为配电网负荷预测、配电网重过载预警;E-mail:?15209809633@ 163.com 李 勇(1982—),男,通信作者,教授,博士生导师,主要研究方向为电力系统运行与控制、电力电子系统与控制等。E-mail:?yongli@hnu.edu.cn

通讯作者:

中图分类号:

基金项目:

国家自然科学基金联合基金重点支持项目资助(U22B200134);国家重点研发计划政府间国际科技创新合作重点项目资助(2022YFE0129300);国网湖南省电力有限公司科技项目资助(5216A521001F)


Dynamic data cleaning method of abnormal and missing data in a distribution network based on machine learning
Author:
Affiliation:

1. School of Electrical Engineering and Information, Hunan University, Changsha 410082, China; 2. State Grid Wenzhou Power Supply Co., Ltd., Wenzhou 325000, China; 3. State Grid Human Electric Power Co., Ltd. Research Institute, Changsha 410007, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对传统配电网数据清洗过程中异常数据判断阈值需要人为设定、缺失数据填补效率不佳的局限性,提出基于机器学习的配电网异常缺失数据一体化动态清洗方法。首先,基于局部异常因子检测算法和高斯混合模型,提出一种异常数据动态检测改进算法,实现配电网异常数据阈值的准确自动选择。其次,基于随机森林算法与最小二乘回归法,提出一种配电网缺失数据动态填补算法。根据缺失数据时间长度自适应优化填补算法,在保证数据填补精度的同时降低计算时间。在此基础上,通过异常数据检测和缺失数据填补共同构建一体化动态清洗架构。采用湖南某地区配电网数据进行实例验证,结果表明所提方法可实现异常辨识阈值准确自动选择,有效检测配电网异常数据,并且实现缺失数据填补精度与速度的平衡,具有较好的工程应用价值。

    Abstract:

    There is a limitation of manual setting of an abnormal data judgment threshold and there will be inefficient filling of missing data in the traditional process of data cleaning in a distribution network. This paper proposes an integrated dynamic cleaning method for distribution network abnormal and missing data based on machine learning. First, based on a local outlier factor and Gaussian mixture model, an improved dynamic identification algorithm is proposed to realize the automatic selection of threshold of abnormal data. Second, based on the random forest algorithm and least squares regression method, a dynamic filling algorithm for missing data is proposed. Depending on the length of missing data, it adaptively optimizes the filling algorithm to ensure filling accuracy and reduce running time. An integrated dynamic cleaning architecture is built through abnormal data identification and missing data interpolation. The data of the distribution network in a certain area of Hunan are used for example verification. The results show that the proposed method can realize accurate and automatic abnormal data detection and achieve a balance between the filling accuracy and speed of missing data in a distribution network. This has good engineering application value.

    参考文献
    相似文献
    引证文献
引用本文

梅玉杰,李 勇,周王峰,等.基于机器学习的配电网异常缺失数据动态清洗方法[J].电力系统保护与控制,2023,51(7):158-169.[MEI Yujie, LI Yong, ZHOU Wangfeng, et al. Dynamic data cleaning method of abnormal and missing data in a distribution network based on machine learning[J]. Power System Protection and Control,2023,V51(7):158-169]

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2022-06-29
  • 最后修改日期:2022-09-14
  • 录用日期:
  • 在线发布日期: 2023-04-07
  • 出版日期:
文章二维码
关闭
关闭