Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
Effective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users’ side. To improve the auditing efficie...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-04-01
|
Series: | Energies |
Subjects: | |
Online Access: | https://www.mdpi.com/1996-1073/13/8/2039 |
id |
doaj-9f4da83a799f4c50ade623515c61193a |
---|---|
record_format |
Article |
spelling |
doaj-9f4da83a799f4c50ade623515c61193a2020-11-25T02:02:14ZengMDPI AGEnergies1996-10732020-04-01132039203910.3390/en13082039Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest ClassifierZhengwei Qu0Hongwen Li1Yunjing Wang2Jiaxi Zhang3Ahmed Abu-Siada4Yunxiao Yao5Key Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaKey Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaKey Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaKey Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaSchool of Electrical Engineering Computing and Mathematical Sciences, Curtin University, Perth, WA 6102, AustraliaState Grid Hubei DC Operation and Maintenance Company, Yichang 443008, ChinaEffective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users’ side. To improve the auditing efficiency of grid enterprises, a new electricity theft detection method based on improved synthetic minority oversampling technique (SMOTE) and improve random forest (RF) method is proposed in this paper. The data of normal and electricity theft users were classified as positive data (PD) and negative data (ND), respectively. In practice, the number of ND was far less than PD, which made the dataset composed of these two types of data become unbalanced. An improved SOMTE based on K-means clustering algorithm (K-SMOTE) was firstly presented to balance the dataset. The cluster center of ND was determined by K-means method. Then, the ND were interpolated by SMOTE on the basis of the cluster center to balance the entire data. Finally, the RF classifier was trained with the balanced dataset, and the optimal number of decision trees in RF was decided according to the convergence of out-of-bag data error (OOB error). Electricity theft behaviors on the user side were detected by the trained RF classifier.https://www.mdpi.com/1996-1073/13/8/2039smart gridnontechnical losseselectricity theft detectionsynthetic minority oversampling techniqueK-means clusterrandom forest |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhengwei Qu Hongwen Li Yunjing Wang Jiaxi Zhang Ahmed Abu-Siada Yunxiao Yao |
spellingShingle |
Zhengwei Qu Hongwen Li Yunjing Wang Jiaxi Zhang Ahmed Abu-Siada Yunxiao Yao Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier Energies smart grid nontechnical losses electricity theft detection synthetic minority oversampling technique K-means cluster random forest |
author_facet |
Zhengwei Qu Hongwen Li Yunjing Wang Jiaxi Zhang Ahmed Abu-Siada Yunxiao Yao |
author_sort |
Zhengwei Qu |
title |
Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier |
title_short |
Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier |
title_full |
Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier |
title_fullStr |
Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier |
title_full_unstemmed |
Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier |
title_sort |
detection of electricity theft behavior based on improved synthetic minority oversampling technique and random forest classifier |
publisher |
MDPI AG |
series |
Energies |
issn |
1996-1073 |
publishDate |
2020-04-01 |
description |
Effective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users’ side. To improve the auditing efficiency of grid enterprises, a new electricity theft detection method based on improved synthetic minority oversampling technique (SMOTE) and improve random forest (RF) method is proposed in this paper. The data of normal and electricity theft users were classified as positive data (PD) and negative data (ND), respectively. In practice, the number of ND was far less than PD, which made the dataset composed of these two types of data become unbalanced. An improved SOMTE based on K-means clustering algorithm (K-SMOTE) was firstly presented to balance the dataset. The cluster center of ND was determined by K-means method. Then, the ND were interpolated by SMOTE on the basis of the cluster center to balance the entire data. Finally, the RF classifier was trained with the balanced dataset, and the optimal number of decision trees in RF was decided according to the convergence of out-of-bag data error (OOB error). Electricity theft behaviors on the user side were detected by the trained RF classifier. |
topic |
smart grid nontechnical losses electricity theft detection synthetic minority oversampling technique K-means cluster random forest |
url |
https://www.mdpi.com/1996-1073/13/8/2039 |
work_keys_str_mv |
AT zhengweiqu detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier AT hongwenli detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier AT yunjingwang detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier AT jiaxizhang detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier AT ahmedabusiada detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier AT yunxiaoyao detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier |
_version_ |
1724954241423900672 |