Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier

Effective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users’ side. To improve the auditing efficie...

Full description

Bibliographic Details
Main Authors: Zhengwei Qu, Hongwen Li, Yunjing Wang, Jiaxi Zhang, Ahmed Abu-Siada, Yunxiao Yao
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:Energies
Subjects:
Online Access:https://www.mdpi.com/1996-1073/13/8/2039
id doaj-9f4da83a799f4c50ade623515c61193a
record_format Article
spelling doaj-9f4da83a799f4c50ade623515c61193a2020-11-25T02:02:14ZengMDPI AGEnergies1996-10732020-04-01132039203910.3390/en13082039Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest ClassifierZhengwei Qu0Hongwen Li1Yunjing Wang2Jiaxi Zhang3Ahmed Abu-Siada4Yunxiao Yao5Key Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaKey Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaKey Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaKey Laboratory of Power Electronics for Energy Conservation and Drive Control, Yanshan University, Qinhuangdao 066004, ChinaSchool of Electrical Engineering Computing and Mathematical Sciences, Curtin University, Perth, WA 6102, AustraliaState Grid Hubei DC Operation and Maintenance Company, Yichang 443008, ChinaEffective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users’ side. To improve the auditing efficiency of grid enterprises, a new electricity theft detection method based on improved synthetic minority oversampling technique (SMOTE) and improve random forest (RF) method is proposed in this paper. The data of normal and electricity theft users were classified as positive data (PD) and negative data (ND), respectively. In practice, the number of ND was far less than PD, which made the dataset composed of these two types of data become unbalanced. An improved SOMTE based on K-means clustering algorithm (K-SMOTE) was firstly presented to balance the dataset. The cluster center of ND was determined by K-means method. Then, the ND were interpolated by SMOTE on the basis of the cluster center to balance the entire data. Finally, the RF classifier was trained with the balanced dataset, and the optimal number of decision trees in RF was decided according to the convergence of out-of-bag data error (OOB error). Electricity theft behaviors on the user side were detected by the trained RF classifier.https://www.mdpi.com/1996-1073/13/8/2039smart gridnontechnical losseselectricity theft detectionsynthetic minority oversampling techniqueK-means clusterrandom forest
collection DOAJ
language English
format Article
sources DOAJ
author Zhengwei Qu
Hongwen Li
Yunjing Wang
Jiaxi Zhang
Ahmed Abu-Siada
Yunxiao Yao
spellingShingle Zhengwei Qu
Hongwen Li
Yunjing Wang
Jiaxi Zhang
Ahmed Abu-Siada
Yunxiao Yao
Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
Energies
smart grid
nontechnical losses
electricity theft detection
synthetic minority oversampling technique
K-means cluster
random forest
author_facet Zhengwei Qu
Hongwen Li
Yunjing Wang
Jiaxi Zhang
Ahmed Abu-Siada
Yunxiao Yao
author_sort Zhengwei Qu
title Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
title_short Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
title_full Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
title_fullStr Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
title_full_unstemmed Detection of Electricity Theft Behavior Based on Improved Synthetic Minority Oversampling Technique and Random Forest Classifier
title_sort detection of electricity theft behavior based on improved synthetic minority oversampling technique and random forest classifier
publisher MDPI AG
series Energies
issn 1996-1073
publishDate 2020-04-01
description Effective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data on the users’ side. To improve the auditing efficiency of grid enterprises, a new electricity theft detection method based on improved synthetic minority oversampling technique (SMOTE) and improve random forest (RF) method is proposed in this paper. The data of normal and electricity theft users were classified as positive data (PD) and negative data (ND), respectively. In practice, the number of ND was far less than PD, which made the dataset composed of these two types of data become unbalanced. An improved SOMTE based on K-means clustering algorithm (K-SMOTE) was firstly presented to balance the dataset. The cluster center of ND was determined by K-means method. Then, the ND were interpolated by SMOTE on the basis of the cluster center to balance the entire data. Finally, the RF classifier was trained with the balanced dataset, and the optimal number of decision trees in RF was decided according to the convergence of out-of-bag data error (OOB error). Electricity theft behaviors on the user side were detected by the trained RF classifier.
topic smart grid
nontechnical losses
electricity theft detection
synthetic minority oversampling technique
K-means cluster
random forest
url https://www.mdpi.com/1996-1073/13/8/2039
work_keys_str_mv AT zhengweiqu detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier
AT hongwenli detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier
AT yunjingwang detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier
AT jiaxizhang detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier
AT ahmedabusiada detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier
AT yunxiaoyao detectionofelectricitytheftbehaviorbasedonimprovedsyntheticminorityoversamplingtechniqueandrandomforestclassifier
_version_ 1724954241423900672