Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets
Although most conventional methods of preserving data privacy focus on static datasets, which remain unchanged after processing, real-world datasets may be dynamically modified often. Therefore, privacy-preservation methods must maintain data privacy after dataset modification. Re-anonymization of e...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8805309/ |
id |
doaj-9b6a433496844065b0192dd41a19fb70 |
---|---|
record_format |
Article |
spelling |
doaj-9b6a433496844065b0192dd41a19fb702021-03-29T23:14:23ZengIEEEIEEE Access2169-35362019-01-01712287812288810.1109/ACCESS.2019.29363018805309Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published DatasetsOdsuren Temuujin0Jinhyun Ahn1https://orcid.org/0000-0002-2331-004XDong-Hyuk Im2https://orcid.org/0000-0002-0290-755XDepartment of Computer and Information Engineering, Hoseo University, Asan, South KoreaDepartment of Management Information Systems, Jeju National University, Jeju, South KoreaDepartment of Computer and Information Engineering, Hoseo University, Asan, South KoreaAlthough most conventional methods of preserving data privacy focus on static datasets, which remain unchanged after processing, real-world datasets may be dynamically modified often. Therefore, privacy-preservation methods must maintain data privacy after dataset modification. Re-anonymization of entire datasets is inefficient when large datasets are frequently modified. Although several previous studies have addressed data privacy for incremental data updates (i.e., record insertions), they have not adequately it for dynamic changes made to existing datasets (i.e., record updates and deletions). Therefore, we identified limitations of data-privacy preservation for dynamically evolving datasets and used anatomy instead of generalization and suppression to develop a more efficient l-diversity algorithm for preserving privacy of such datasets. We also used a Cuckoo filter, a new probabilistic data structure for approximate set-membership tests, to improve data-processing efficiency. Experimental results demonstrated that our proposed data-anonymization algorithm processed data more efficiently than other conventional algorithms, requiring much less running time than conventional re-anonymization of entire datasets. The Cuckoo-filtered algorithm was especially efficient, dramatically reducing operation execution times while maintaining privacy of dynamically evolving datasets.https://ieeexplore.ieee.org/document/8805309/AnonymizationCuckoo filterdynamic data publishingl-diversityprivacy-preservation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Odsuren Temuujin Jinhyun Ahn Dong-Hyuk Im |
spellingShingle |
Odsuren Temuujin Jinhyun Ahn Dong-Hyuk Im Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets IEEE Access Anonymization Cuckoo filter dynamic data publishing l-diversity privacy-preservation |
author_facet |
Odsuren Temuujin Jinhyun Ahn Dong-Hyuk Im |
author_sort |
Odsuren Temuujin |
title |
Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets |
title_short |
Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets |
title_full |
Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets |
title_fullStr |
Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets |
title_full_unstemmed |
Efficient L-Diversity Algorithm for Preserving Privacy of Dynamically Published Datasets |
title_sort |
efficient l-diversity algorithm for preserving privacy of dynamically published datasets |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Although most conventional methods of preserving data privacy focus on static datasets, which remain unchanged after processing, real-world datasets may be dynamically modified often. Therefore, privacy-preservation methods must maintain data privacy after dataset modification. Re-anonymization of entire datasets is inefficient when large datasets are frequently modified. Although several previous studies have addressed data privacy for incremental data updates (i.e., record insertions), they have not adequately it for dynamic changes made to existing datasets (i.e., record updates and deletions). Therefore, we identified limitations of data-privacy preservation for dynamically evolving datasets and used anatomy instead of generalization and suppression to develop a more efficient l-diversity algorithm for preserving privacy of such datasets. We also used a Cuckoo filter, a new probabilistic data structure for approximate set-membership tests, to improve data-processing efficiency. Experimental results demonstrated that our proposed data-anonymization algorithm processed data more efficiently than other conventional algorithms, requiring much less running time than conventional re-anonymization of entire datasets. The Cuckoo-filtered algorithm was especially efficient, dramatically reducing operation execution times while maintaining privacy of dynamically evolving datasets. |
topic |
Anonymization Cuckoo filter dynamic data publishing l-diversity privacy-preservation |
url |
https://ieeexplore.ieee.org/document/8805309/ |
work_keys_str_mv |
AT odsurentemuujin efficientldiversityalgorithmforpreservingprivacyofdynamicallypublisheddatasets AT jinhyunahn efficientldiversityalgorithmforpreservingprivacyofdynamicallypublisheddatasets AT donghyukim efficientldiversityalgorithmforpreservingprivacyofdynamicallypublisheddatasets |
_version_ |
1724189899443142656 |