Using machine learning ensemble method for detection of energy theft in smart meters
Kawoosa, Asif Iqbal; Prashar, Deepak; Faheem, Muhammad; Jha, Nishant; Khan, Arfat Ahmad (2023-09-28)
Kawoosa, Asif Iqbal
Prashar, Deepak
Faheem, Muhammad
Jha, Nishant
Khan, Arfat Ahmad
The Institution of Engineering and Technology
28.09.2023
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi-fe20231002138049
https://urn.fi/URN:NBN:fi-fe20231002138049
Kuvaus
vertaisarvioitu
© 2023 The Authors. IET Generation, Transmission & Distribution published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
© 2023 The Authors. IET Generation, Transmission & Distribution published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
Tiivistelmä
Electricity theft is a primary concern for utility providers, as it leads to substantial financial losses. To address the issue, a novel extreme gradient boosting (XGBoost)-based model utilizing the consumers’ electricity consumption patterns for analysis is proposed for electricity theft detection (ETD). To remove the imbalance in the real-world electricity consumption dataset and ensure an even distribution of theft and non-theft data instances, six different artificially created theft attacks were used. Moreover, the utilization of the XGBoost algorithm for classification, especially to identify malicious instances of electricity theft, yielded commendable accuracy rates and a minimal occurrence of false positives. The proposed model identifies electricity theft specific to the regions, utilizing electricity consumption parameters, and other variables as input features. The authors’ model outperformed existing benchmarks like k-neural networks, light gradient boost, random forest, support vector machine, decision tree, and AdaBoost. The simulation results using the false attacks for balancing the dataset have improved the proposed model's performance, achieving a precision, recall, and F1-score of 96%, 95%, and 95%, respectively. The results of the detection rate and the false positive rate (FPR) of the proposed XGBoost-based detection model have achieved 96% and 3%, respectively.
Kokoelmat
- Artikkelit [2910]