Int J Performability Eng ›› 2019, Vol. 15 ›› Issue (9): 2453-2461.doi: 10.23940/ijpe.19.09.p19.24532461

Previous Articles     Next Articles

Application of Improved Feature Pre-processing Method in Prevention and Control of Electricity Charge Risk

Huaiguang Wua, Yongsheng Shia,*, Shenyi Qiana, Hongwei Taoa, and Jiangtao Maa,b   

  1. aZhengzhou University of Light Industry, Zhengzhou, 450001, China;
    bNational Digital Switching System Engineering and Technological R&D Center, Zhengzhou, 450001, China
  • Submitted on ; Revised on ; Accepted on
  • Contact: *.E-mail address: 1791526258@qq.com

Abstract: With the increase in power data, past data processing technologies cannot meet the needs of rapid processing and intelligent analysis of massive data. In order to solve the problems of current performance bottleneck in the calculation of power big data and recover users' electricity tariff arrears in a timely manner, we use the technology of combining parallel computing with feature expansion to establish a feature expansion method based on Spark distributed computing (FESDC). According to the data generated by the proposed method, we use a parallel logistic regression model to predict the arrears probability of future electricity users, which can effectively prevent and resolve the risk of electricity tariff recovery (ETR). Compared with the data processing method for single process (DPSP), the proposed method not only increased the accuracy of prediction, but also improved the performance of processing data.

Key words: Power big data, electricity tariff recovery, Spark, feature processing