Abstract
This paper proposes a process for the classification of new residential electricity customers. The current state of the art is extended by using a combination of smart metering and survey data and by using model-based feature selection for the classification task. Firstly, the normalized representative consumption profiles of the population are derived through the clustering of data from households. Secondly, new customers are classified using survey data and a limited amount of smart metering data. Thirdly, regression analysis and model-based feature selection results explain the importance of the variables and which are the drivers of different consumption profiles, enabling the extraction of appropriate models. The results of a case study show that the use of survey data significantly increases accuracy of the classification task (up to 20%). Considering four consumption groups, more than half of the customers are correctly classified with only one week of metering data, with more weeks the accuracy is significantly improved. The use of model-based feature selection resulted in the use of a significantly lower number of features allowing an easy interpretation of the derived models.
Code available in: Visualization