Waveform clustering - Grouping similar power system events

Over the last decade, data has become a highly valuable resource. Electrical power grids deal with large quantities of data, and continuously collect this for analytical purposes. Anomalies that occur within this data is important to identify since they could cause nonoptimal performance within the...

Full description

Bibliographic Details
Main Authors: Eriksson, Therése, Mahmoud Abdelnaeim, Mohamed
Format: Others
Language:English
Published: Mälardalens högskola, Akademin för innovation, design och teknik 2019
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-44147
id ndltd-UPSALLA1-oai-DiVA.org-mdh-44147
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-mdh-441472019-09-19T04:22:50ZWaveform clustering - Grouping similar power system eventsengEriksson, TheréseMahmoud Abdelnaeim, MohamedMälardalens högskola, Akademin för innovation, design och teknikMälardalens högskola, Akademin för innovation, design och teknik2019Unsupervised learningclusteringmultivariate time-series datadiscrete wavelet transformpower transformerComputer Vision and Robotics (Autonomous Systems)Datorseende och robotik (autonoma system)Over the last decade, data has become a highly valuable resource. Electrical power grids deal with large quantities of data, and continuously collect this for analytical purposes. Anomalies that occur within this data is important to identify since they could cause nonoptimal performance within the substations, or in worse cases damage to the substations themselves. However, large datasets in the order of millions are hard or even impossible to gain a reasonable overview of the data manually. When collecting data from electrical power grids, predefined triggering criteria are often used to indicate that an event has occurred within the specific system. This makes it difficult to search for events that are unknown to the operator of the deployed acquisition system. Clustering, an unsupervised machine learning method, can be utilised for fault prediction within systems generating large amounts of multivariate time-series data without labels and can group data more efficiently and without the bias of a human operator. A large number of clustering techniques exist, as well as methods for extracting information from the data itself, and identification of these was of utmost importance. This thesis work presents a study of the methods involved in the creation of such a clustering system which is suitable for the specific type of data. The objective of the study was to identify methods that enables finding the underlying structures of the data and cluster the data based on these. The signals were split into multiple frequency sub-bands and from these features could be extracted and evaluated. Using suitable combinations of features the data was clustered with two different clustering algorithms, CLARA and CLARANS, and evaluated with established quality analysis methods. The results indicate that CLARA performed overall best on all the tested feature sets. The formed clusters hold valuable information such as indications of unknown events within the system, and if similar events are clustered together this can assist a human operator further to investigate the importance of the clusters themselves. A further conclusion from the results is that research into the use of more optimised clustering algorithms is necessary so that expansion into larger datasets can be considered. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-44147application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Unsupervised learning
clustering
multivariate time-series data
discrete wavelet transform
power transformer
Computer Vision and Robotics (Autonomous Systems)
Datorseende och robotik (autonoma system)
spellingShingle Unsupervised learning
clustering
multivariate time-series data
discrete wavelet transform
power transformer
Computer Vision and Robotics (Autonomous Systems)
Datorseende och robotik (autonoma system)
Eriksson, Therése
Mahmoud Abdelnaeim, Mohamed
Waveform clustering - Grouping similar power system events
description Over the last decade, data has become a highly valuable resource. Electrical power grids deal with large quantities of data, and continuously collect this for analytical purposes. Anomalies that occur within this data is important to identify since they could cause nonoptimal performance within the substations, or in worse cases damage to the substations themselves. However, large datasets in the order of millions are hard or even impossible to gain a reasonable overview of the data manually. When collecting data from electrical power grids, predefined triggering criteria are often used to indicate that an event has occurred within the specific system. This makes it difficult to search for events that are unknown to the operator of the deployed acquisition system. Clustering, an unsupervised machine learning method, can be utilised for fault prediction within systems generating large amounts of multivariate time-series data without labels and can group data more efficiently and without the bias of a human operator. A large number of clustering techniques exist, as well as methods for extracting information from the data itself, and identification of these was of utmost importance. This thesis work presents a study of the methods involved in the creation of such a clustering system which is suitable for the specific type of data. The objective of the study was to identify methods that enables finding the underlying structures of the data and cluster the data based on these. The signals were split into multiple frequency sub-bands and from these features could be extracted and evaluated. Using suitable combinations of features the data was clustered with two different clustering algorithms, CLARA and CLARANS, and evaluated with established quality analysis methods. The results indicate that CLARA performed overall best on all the tested feature sets. The formed clusters hold valuable information such as indications of unknown events within the system, and if similar events are clustered together this can assist a human operator further to investigate the importance of the clusters themselves. A further conclusion from the results is that research into the use of more optimised clustering algorithms is necessary so that expansion into larger datasets can be considered.
author Eriksson, Therése
Mahmoud Abdelnaeim, Mohamed
author_facet Eriksson, Therése
Mahmoud Abdelnaeim, Mohamed
author_sort Eriksson, Therése
title Waveform clustering - Grouping similar power system events
title_short Waveform clustering - Grouping similar power system events
title_full Waveform clustering - Grouping similar power system events
title_fullStr Waveform clustering - Grouping similar power system events
title_full_unstemmed Waveform clustering - Grouping similar power system events
title_sort waveform clustering - grouping similar power system events
publisher Mälardalens högskola, Akademin för innovation, design och teknik
publishDate 2019
url http://urn.kb.se/resolve?urn=urn:nbn:se:mdh:diva-44147
work_keys_str_mv AT erikssontherese waveformclusteringgroupingsimilarpowersystemevents
AT mahmoudabdelnaeimmohamed waveformclusteringgroupingsimilarpowersystemevents
_version_ 1719252838728597504