Evaluation of clustering techniques for generating household energy consumption patterns in a developing country

This work compares and evaluates clustering techniques for generating representative daily load profiles that are characteristic of residential energy consumers in South Africa. The input data captures two decades of metered household consumption, covering 14 945 household years and 3 295 848 daily...

Full description

Bibliographic Details
Main Author: Toussaint, Wiebke
Other Authors: Moodley, Deshen
Format: Dissertation
Language:English
Published: Faculty of Science 2020
Subjects:
Online Access:http://hdl.handle.net/11427/30905
id ndltd-netd.ac.za-oai-union.ndltd.org-uct-oai-localhost-11427-30905
record_format oai_dc
spelling ndltd-netd.ac.za-oai-union.ndltd.org-uct-oai-localhost-11427-309052020-10-06T05:11:13Z Evaluation of clustering techniques for generating household energy consumption patterns in a developing country Toussaint, Wiebke Moodley, Deshen Meyer, Thomas Computer Science This work compares and evaluates clustering techniques for generating representative daily load profiles that are characteristic of residential energy consumers in South Africa. The input data captures two decades of metered household consumption, covering 14 945 household years and 3 295 848 daily load patterns of a population with high variability across temporal, geographic, social and economic dimensions. Different algorithms, normalisation and pre-binning techniques are evaluated to determine the best clustering structure. The study shows that normalisation is essential for producing good clusters. Specifically, unit norm produces more usable and more expressive clusters than the zero-one scaler, which is the most common method of normalisation used in the domain. While pre-binning improves clustering results for the dataset, the choice of pre-binning method does not significantly impact the quality of clusters produced. Data representation and especially the inclusion or removal of zero-valued profiles is an important consideration in relation to the pre-binning approach selected. Like several previous studies, the k-means algorithm produces the best results. Introducing a qualitative evaluation framework facilitated the evaluation process and helped identify a top clustering structure that is significantly more useable than those that would have been selected based on quantitative metrics alone. The approach demonstrates how explicitly defined qualitative evaluation measures can aid in selecting a clustering structure that is more likely to have real world application. To our knowledge this is the first work that uses cluster analysis to generate customer archetypes from representative daily load profiles in a highly variable, developing country context 2020-02-07T09:36:55Z 2020-02-07T09:36:55Z 2019 2020-01-24T09:35:12Z Master Thesis Masters MSc http://hdl.handle.net/11427/30905 eng application/pdf Faculty of Science Department of Computer Science
collection NDLTD
language English
format Dissertation
sources NDLTD
topic Computer Science
spellingShingle Computer Science
Toussaint, Wiebke
Evaluation of clustering techniques for generating household energy consumption patterns in a developing country
description This work compares and evaluates clustering techniques for generating representative daily load profiles that are characteristic of residential energy consumers in South Africa. The input data captures two decades of metered household consumption, covering 14 945 household years and 3 295 848 daily load patterns of a population with high variability across temporal, geographic, social and economic dimensions. Different algorithms, normalisation and pre-binning techniques are evaluated to determine the best clustering structure. The study shows that normalisation is essential for producing good clusters. Specifically, unit norm produces more usable and more expressive clusters than the zero-one scaler, which is the most common method of normalisation used in the domain. While pre-binning improves clustering results for the dataset, the choice of pre-binning method does not significantly impact the quality of clusters produced. Data representation and especially the inclusion or removal of zero-valued profiles is an important consideration in relation to the pre-binning approach selected. Like several previous studies, the k-means algorithm produces the best results. Introducing a qualitative evaluation framework facilitated the evaluation process and helped identify a top clustering structure that is significantly more useable than those that would have been selected based on quantitative metrics alone. The approach demonstrates how explicitly defined qualitative evaluation measures can aid in selecting a clustering structure that is more likely to have real world application. To our knowledge this is the first work that uses cluster analysis to generate customer archetypes from representative daily load profiles in a highly variable, developing country context
author2 Moodley, Deshen
author_facet Moodley, Deshen
Toussaint, Wiebke
author Toussaint, Wiebke
author_sort Toussaint, Wiebke
title Evaluation of clustering techniques for generating household energy consumption patterns in a developing country
title_short Evaluation of clustering techniques for generating household energy consumption patterns in a developing country
title_full Evaluation of clustering techniques for generating household energy consumption patterns in a developing country
title_fullStr Evaluation of clustering techniques for generating household energy consumption patterns in a developing country
title_full_unstemmed Evaluation of clustering techniques for generating household energy consumption patterns in a developing country
title_sort evaluation of clustering techniques for generating household energy consumption patterns in a developing country
publisher Faculty of Science
publishDate 2020
url http://hdl.handle.net/11427/30905
work_keys_str_mv AT toussaintwiebke evaluationofclusteringtechniquesforgeneratinghouseholdenergyconsumptionpatternsinadevelopingcountry
_version_ 1719348770912600064