Computational efficiency of k-anonymization incorporating clustering
A research report submitted in partial fulfillment of the requirements for the degree of Master of Science in the field of e-Science in the School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2020 === Data publicizing pose a threat of disclosing data su...
Main Authors: | , |
---|---|
Format: | Others |
Language: | en |
Published: |
2020
|
Online Access: | https://hdl.handle.net/10539/30171 |
id |
ndltd-netd.ac.za-oai-union.ndltd.org-wits-oai-wiredspace.wits.ac.za-10539-30171 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-netd.ac.za-oai-union.ndltd.org-wits-oai-wiredspace.wits.ac.za-10539-301712021-05-24T05:08:12Z Computational efficiency of k-anonymization incorporating clustering Netshiunda, Fhulufhelo Emmanuel Emmanuel, Netshiunda Fhulufhelo A research report submitted in partial fulfillment of the requirements for the degree of Master of Science in the field of e-Science in the School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2020 Data publicizing pose a threat of disclosing data subjects associating them to their personal sensitive information. k-anonymization is a practical method used to anonymize datasets to be made publicly available. The k-anonymization hides identities of data subjects by ensuring that every record of a publicized dataset has at least k �� 1 (k being a natural number) other records similar to it with respect to a set of attributes called quasi-identifiers. To minimize information loss, a clustering technique is often used to group similar records before k-anonymization is applied. Processing both the clustering and the k-anonymization using current algorithms is computationally expensive. It is within this framework that this research focuses on parallel implementation of the k-anonymization algorithm which incorporates clustering to achieve time effective computations CK2020 2020-11-16T07:40:54Z 2020-11-16T07:40:54Z 2020 Thesis https://hdl.handle.net/10539/30171 en application/pdf |
collection |
NDLTD |
language |
en |
format |
Others
|
sources |
NDLTD |
description |
A research report submitted in partial fulfillment of the requirements for the degree of Master of Science in the field of e-Science in the School of Computer Science and Applied Mathematics, University of the Witwatersrand, Johannesburg, 2020 === Data publicizing pose a threat of disclosing data subjects associating
them to their personal sensitive information. k-anonymization is a practical
method used to anonymize datasets to be made publicly available.
The k-anonymization hides identities of data subjects by ensuring that
every record of a publicized dataset has at least k �� 1 (k being a natural
number) other records similar to it with respect to a set of attributes
called quasi-identifiers. To minimize information loss, a clustering technique
is often used to group similar records before k-anonymization is
applied. Processing both the clustering and the k-anonymization using
current algorithms is computationally expensive. It is within this
framework that this research focuses on parallel implementation of the
k-anonymization algorithm which incorporates clustering to achieve time
effective computations === CK2020 |
author |
Netshiunda, Fhulufhelo Emmanuel Emmanuel, Netshiunda Fhulufhelo |
spellingShingle |
Netshiunda, Fhulufhelo Emmanuel Emmanuel, Netshiunda Fhulufhelo Computational efficiency of k-anonymization incorporating clustering |
author_facet |
Netshiunda, Fhulufhelo Emmanuel Emmanuel, Netshiunda Fhulufhelo |
author_sort |
Netshiunda, Fhulufhelo Emmanuel |
title |
Computational efficiency of k-anonymization incorporating clustering |
title_short |
Computational efficiency of k-anonymization incorporating clustering |
title_full |
Computational efficiency of k-anonymization incorporating clustering |
title_fullStr |
Computational efficiency of k-anonymization incorporating clustering |
title_full_unstemmed |
Computational efficiency of k-anonymization incorporating clustering |
title_sort |
computational efficiency of k-anonymization incorporating clustering |
publishDate |
2020 |
url |
https://hdl.handle.net/10539/30171 |
work_keys_str_mv |
AT netshiundafhulufheloemmanuel computationalefficiencyofkanonymizationincorporatingclustering AT emmanuelnetshiundafhulufhelo computationalefficiencyofkanonymizationincorporatingclustering |
_version_ |
1719405785115525120 |