Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>

In recent years, data-enabled technologies have intensified the rate and scale at which organisations collect and analyse data. Data mining techniques are applied to realise the full potential of large-scale data analysis. These techniques are highly efficient in sifting through big data to extract...

Full description

Bibliographic Details
Main Authors: Kabiru Mohammed, Aladdin Ayesh, Eerke Boiten
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Cryptography
Subjects:
Online Access:https://www.mdpi.com/2410-387X/5/3/20
id doaj-e0924dba631b4a83ab6c237f6bc1f45a
record_format Article
spelling doaj-e0924dba631b4a83ab6c237f6bc1f45a2021-09-25T23:57:16ZengMDPI AGCryptography2410-387X2021-08-015202010.3390/cryptography5030020Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>Kabiru Mohammed0Aladdin Ayesh1Eerke Boiten2Cyber Technology Institute, De Montfort University, The Gateway, Leicester LE1 9BH, UKCyber Technology Institute, De Montfort University, The Gateway, Leicester LE1 9BH, UKCyber Technology Institute, De Montfort University, The Gateway, Leicester LE1 9BH, UKIn recent years, data-enabled technologies have intensified the rate and scale at which organisations collect and analyse data. Data mining techniques are applied to realise the full potential of large-scale data analysis. These techniques are highly efficient in sifting through big data to extract hidden knowledge and assist evidence-based decisions, offering significant benefits to their adopters. However, this capability is constrained by important legal, ethical and reputational concerns. These concerns arise because they can be exploited to allow inferences to be made on sensitive data, thus posing severe threats to individuals’ privacy. Studies have shown Privacy-Preserving Data Mining (PPDM) can adequately address this privacy risk and permit knowledge extraction in mining processes. Several published works in this area have utilised clustering techniques to enforce anonymisation models on private data, which work by grouping the data into clusters using a quality measure and generalising the data in each group separately to achieve an anonymisation threshold. However, existing approaches do not work well with high-dimensional data, since it is difficult to develop good groupings without incurring excessive information loss. Our work aims to complement this balancing act by optimising utility in PPDM processes. To illustrate this, we propose a hybrid approach, that combines self-organising maps with conventional privacy-based clustering algorithms. We demonstrate through experimental evaluation, that results from our approach produce more utility for data mining tasks and outperforms conventional privacy-based clustering algorithms. This approach can significantly enable large-scale analysis of data in a privacy-preserving and trustworthy manner.https://www.mdpi.com/2410-387X/5/3/20<i>k</i>-anonymityclusteringself-organising mapprivacy preserving data mining
collection DOAJ
language English
format Article
sources DOAJ
author Kabiru Mohammed
Aladdin Ayesh
Eerke Boiten
spellingShingle Kabiru Mohammed
Aladdin Ayesh
Eerke Boiten
Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>
Cryptography
<i>k</i>-anonymity
clustering
self-organising map
privacy preserving data mining
author_facet Kabiru Mohammed
Aladdin Ayesh
Eerke Boiten
author_sort Kabiru Mohammed
title Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>
title_short Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>
title_full Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>
title_fullStr Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>
title_full_unstemmed Complementing Privacy and Utility Trade-Off with <i>Self-Organising Maps</i>
title_sort complementing privacy and utility trade-off with <i>self-organising maps</i>
publisher MDPI AG
series Cryptography
issn 2410-387X
publishDate 2021-08-01
description In recent years, data-enabled technologies have intensified the rate and scale at which organisations collect and analyse data. Data mining techniques are applied to realise the full potential of large-scale data analysis. These techniques are highly efficient in sifting through big data to extract hidden knowledge and assist evidence-based decisions, offering significant benefits to their adopters. However, this capability is constrained by important legal, ethical and reputational concerns. These concerns arise because they can be exploited to allow inferences to be made on sensitive data, thus posing severe threats to individuals’ privacy. Studies have shown Privacy-Preserving Data Mining (PPDM) can adequately address this privacy risk and permit knowledge extraction in mining processes. Several published works in this area have utilised clustering techniques to enforce anonymisation models on private data, which work by grouping the data into clusters using a quality measure and generalising the data in each group separately to achieve an anonymisation threshold. However, existing approaches do not work well with high-dimensional data, since it is difficult to develop good groupings without incurring excessive information loss. Our work aims to complement this balancing act by optimising utility in PPDM processes. To illustrate this, we propose a hybrid approach, that combines self-organising maps with conventional privacy-based clustering algorithms. We demonstrate through experimental evaluation, that results from our approach produce more utility for data mining tasks and outperforms conventional privacy-based clustering algorithms. This approach can significantly enable large-scale analysis of data in a privacy-preserving and trustworthy manner.
topic <i>k</i>-anonymity
clustering
self-organising map
privacy preserving data mining
url https://www.mdpi.com/2410-387X/5/3/20
work_keys_str_mv AT kabirumohammed complementingprivacyandutilitytradeoffwithiselforganisingmapsi
AT aladdinayesh complementingprivacyandutilitytradeoffwithiselforganisingmapsi
AT eerkeboiten complementingprivacyandutilitytradeoffwithiselforganisingmapsi
_version_ 1717367530945773568