Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets

Although most conventional methods of preserving data privacy focus on static datasets, which remain unchanged after processing, real-world datasets may be dynamically modified often. Therefore, privacy-preservation methods must maintain data privacy after dataset modification. Re-anonymization of e...

Full description

Bibliographic Details
Main Authors: Odsuren Temuujin, Jinhyun Ahn, Dong-Hyuk Im
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8805309/
id doaj-9b6a433496844065b0192dd41a19fb70
record_format Article
spelling doaj-9b6a433496844065b0192dd41a19fb702021-03-29T23:14:23ZengIEEEIEEE Access2169-35362019-01-01712287812288810.1109/ACCESS.2019.29363018805309Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published DatasetsOdsuren Temuujin0Jinhyun Ahn1https://orcid.org/0000-0002-2331-004XDong-Hyuk Im2https://orcid.org/0000-0002-0290-755XDepartment of Computer and Information Engineering, Hoseo University, Asan, South KoreaDepartment of Management Information Systems, Jeju National University, Jeju, South KoreaDepartment of Computer and Information Engineering, Hoseo University, Asan, South KoreaAlthough most conventional methods of preserving data privacy focus on static datasets, which remain unchanged after processing, real-world datasets may be dynamically modified often. Therefore, privacy-preservation methods must maintain data privacy after dataset modification. Re-anonymization of entire datasets is inefficient when large datasets are frequently modified. Although several previous studies have addressed data privacy for incremental data updates (i.e., record insertions), they have not adequately it for dynamic changes made to existing datasets (i.e., record updates and deletions). Therefore, we identified limitations of data-privacy preservation for dynamically evolving datasets and used anatomy instead of generalization and suppression to develop a more efficient l-diversity algorithm for preserving privacy of such datasets. We also used a Cuckoo filter, a new probabilistic data structure for approximate set-membership tests, to improve data-processing efficiency. Experimental results demonstrated that our proposed data-anonymization algorithm processed data more efficiently than other conventional algorithms, requiring much less running time than conventional re-anonymization of entire datasets. The Cuckoo-filtered algorithm was especially efficient, dramatically reducing operation execution times while maintaining privacy of dynamically evolving datasets.https://ieeexplore.ieee.org/document/8805309/AnonymizationCuckoo filterdynamic data publishingl-diversityprivacy-preservation
collection DOAJ
language English
format Article
sources DOAJ
author Odsuren Temuujin
Jinhyun Ahn
Dong-Hyuk Im
spellingShingle Odsuren Temuujin
Jinhyun Ahn
Dong-Hyuk Im
Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
IEEE Access
Anonymization
Cuckoo filter
dynamic data publishing
l-diversity
privacy-preservation
author_facet Odsuren Temuujin
Jinhyun Ahn
Dong-Hyuk Im
author_sort Odsuren Temuujin
title Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
title_short Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
title_full Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
title_fullStr Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
title_full_unstemmed Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
title_sort efficient l-diversity algorithm for preserving privacy of dynamically published datasets
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Although most conventional methods of preserving data privacy focus on static datasets, which remain unchanged after processing, real-world datasets may be dynamically modified often. Therefore, privacy-preservation methods must maintain data privacy after dataset modification. Re-anonymization of entire datasets is inefficient when large datasets are frequently modified. Although several previous studies have addressed data privacy for incremental data updates (i.e., record insertions), they have not adequately it for dynamic changes made to existing datasets (i.e., record updates and deletions). Therefore, we identified limitations of data-privacy preservation for dynamically evolving datasets and used anatomy instead of generalization and suppression to develop a more efficient l-diversity algorithm for preserving privacy of such datasets. We also used a Cuckoo filter, a new probabilistic data structure for approximate set-membership tests, to improve data-processing efficiency. Experimental results demonstrated that our proposed data-anonymization algorithm processed data more efficiently than other conventional algorithms, requiring much less running time than conventional re-anonymization of entire datasets. The Cuckoo-filtered algorithm was especially efficient, dramatically reducing operation execution times while maintaining privacy of dynamically evolving datasets.
topic Anonymization
Cuckoo filter
dynamic data publishing
l-diversity
privacy-preservation
url https://ieeexplore.ieee.org/document/8805309/
work_keys_str_mv AT odsurentemuujin efficientldiversityalgorithmforpreservingprivacyofdynamicallypublisheddatasets
AT jinhyunahn efficientldiversityalgorithmforpreservingprivacyofdynamicallypublisheddatasets
AT donghyukim efficientldiversityalgorithmforpreservingprivacyofdynamicallypublisheddatasets
_version_ 1724189899443142656