Simple Data Augmentation Tricks for Boosting Performance on Electricity Theft Detection Tasks

Wenlong Liao, Zhe Yang, Birgitte Bak-Jensen, Jayakrishnan Radhakrishna Pillai, Leandro Von Krannichfeldt, Yusen Wang, Dechang Yang*

*Kontaktforfatter

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

4 Citationer (Scopus)

Abstract

In practical engineering, electricity theft detection is usually performed on highly imbalanced datasets (i.e., the number of fraudulent samples is much smaller than the benign ones), which limits the accuracy of the classifier. To alleviate the data imbalance problem, this article proposes simple data augmentation tricks (SDAT) to boost performance on electricity theft detection tasks. SDAT includes five simple but powerful operations: adding noises to electricity consumption readings, drifting values of electricity consumption readings, quantizing electricity consumption readings to a level set, adding a fixed value to electricity consumption readings, and adding changeable values to electricity consumption readings. In addition, eight potential tricks are also mentioned. Numerical simulations are conducted on a real-world dataset. The simulation results show that SDAT can significantly boost the performance of different classifiers, especially for small datasets. Besides, specific suggestions on how to select parameters of SDAT are provided for its migration use to other datasets.

OriginalsprogEngelsk
TidsskriftI E E E Transactions on Industry Applications
Vol/bind59
Udgave nummer4
Sider (fra-til)4846-4858
Antal sider13
ISSN0093-9994
DOI
StatusUdgivet - 1 jul. 2023

Fingeraftryk

Dyk ned i forskningsemnerne om 'Simple Data Augmentation Tricks for Boosting Performance on Electricity Theft Detection Tasks'. Sammen danner de et unikt fingeraftryk.

Citationsformater