Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
Data releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreov...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8890933/ |
id |
doaj-2daebeed4fe54e0e801ed0c91411c9be |
---|---|
record_format |
Article |
spelling |
doaj-2daebeed4fe54e0e801ed0c91411c9be2021-03-30T00:31:19ZengIEEEIEEE Access2169-35362019-01-01716496216497410.1109/ACCESS.2019.29513648890933Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential PrivacyYan Yan0https://orcid.org/0000-0002-2885-9867Lianxiu Zhang1Quan Z. Sheng2Bingqian Wang3Xin Gao4Yiming Cong5School of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaDepartment of Computing, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, AustraliaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaData releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreover, the quality of published data cannot reflect the characteristics of the dynamic changes of big data, which often leads to subsequent data analysis and mining errors. This paper proposes an adaptive sampling mechanism and privacy protection method for the release of big location data. In order to reflect the dynamic change of data in time, we design an adaptive sampling mechanism based on the proportional-integral-derivative (PID) controller according to the temporal and spatial correlation of the location data. To ensure the privacy of published data, we propose a heuristic quad-tree partitioning method as well as a corresponding privacy budget allocation strategy. Experiments and analysis prove that the adaptive sampling mechanism proposed in this paper can effectively track the trend of dynamic changes of data, and the designed differential privacy method can improve the accuracy of counting query and enhance the availability of published data under the premise of certain privacy intensity. The proposed methods can also be readily extended to other areas of big data release applications.https://ieeexplore.ieee.org/document/8890933/Big location dataprivacy preserving data publishingadaptive samplingdifferential privacyheuristic quad-tree partitioning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yan Yan Lianxiu Zhang Quan Z. Sheng Bingqian Wang Xin Gao Yiming Cong |
spellingShingle |
Yan Yan Lianxiu Zhang Quan Z. Sheng Bingqian Wang Xin Gao Yiming Cong Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy IEEE Access Big location data privacy preserving data publishing adaptive sampling differential privacy heuristic quad-tree partitioning |
author_facet |
Yan Yan Lianxiu Zhang Quan Z. Sheng Bingqian Wang Xin Gao Yiming Cong |
author_sort |
Yan Yan |
title |
Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy |
title_short |
Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy |
title_full |
Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy |
title_fullStr |
Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy |
title_full_unstemmed |
Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy |
title_sort |
dynamic release of big location data based on adaptive sampling and differential privacy |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Data releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreover, the quality of published data cannot reflect the characteristics of the dynamic changes of big data, which often leads to subsequent data analysis and mining errors. This paper proposes an adaptive sampling mechanism and privacy protection method for the release of big location data. In order to reflect the dynamic change of data in time, we design an adaptive sampling mechanism based on the proportional-integral-derivative (PID) controller according to the temporal and spatial correlation of the location data. To ensure the privacy of published data, we propose a heuristic quad-tree partitioning method as well as a corresponding privacy budget allocation strategy. Experiments and analysis prove that the adaptive sampling mechanism proposed in this paper can effectively track the trend of dynamic changes of data, and the designed differential privacy method can improve the accuracy of counting query and enhance the availability of published data under the premise of certain privacy intensity. The proposed methods can also be readily extended to other areas of big data release applications. |
topic |
Big location data privacy preserving data publishing adaptive sampling differential privacy heuristic quad-tree partitioning |
url |
https://ieeexplore.ieee.org/document/8890933/ |
work_keys_str_mv |
AT yanyan dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy AT lianxiuzhang dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy AT quanzsheng dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy AT bingqianwang dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy AT xingao dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy AT yimingcong dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy |
_version_ |
1724188203049549824 |