Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy

Data releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreov...

Full description

Bibliographic Details
Main Authors: Yan Yan, Lianxiu Zhang, Quan Z. Sheng, Bingqian Wang, Xin Gao, Yiming Cong
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8890933/
id doaj-2daebeed4fe54e0e801ed0c91411c9be
record_format Article
spelling doaj-2daebeed4fe54e0e801ed0c91411c9be2021-03-30T00:31:19ZengIEEEIEEE Access2169-35362019-01-01716496216497410.1109/ACCESS.2019.29513648890933Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential PrivacyYan Yan0https://orcid.org/0000-0002-2885-9867Lianxiu Zhang1Quan Z. Sheng2Bingqian Wang3Xin Gao4Yiming Cong5School of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaDepartment of Computing, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, AustraliaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaSchool of Computer and Communication, Lanzhou University of Technology, Lanzhou, ChinaData releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreover, the quality of published data cannot reflect the characteristics of the dynamic changes of big data, which often leads to subsequent data analysis and mining errors. This paper proposes an adaptive sampling mechanism and privacy protection method for the release of big location data. In order to reflect the dynamic change of data in time, we design an adaptive sampling mechanism based on the proportional-integral-derivative (PID) controller according to the temporal and spatial correlation of the location data. To ensure the privacy of published data, we propose a heuristic quad-tree partitioning method as well as a corresponding privacy budget allocation strategy. Experiments and analysis prove that the adaptive sampling mechanism proposed in this paper can effectively track the trend of dynamic changes of data, and the designed differential privacy method can improve the accuracy of counting query and enhance the availability of published data under the premise of certain privacy intensity. The proposed methods can also be readily extended to other areas of big data release applications.https://ieeexplore.ieee.org/document/8890933/Big location dataprivacy preserving data publishingadaptive samplingdifferential privacyheuristic quad-tree partitioning
collection DOAJ
language English
format Article
sources DOAJ
author Yan Yan
Lianxiu Zhang
Quan Z. Sheng
Bingqian Wang
Xin Gao
Yiming Cong
spellingShingle Yan Yan
Lianxiu Zhang
Quan Z. Sheng
Bingqian Wang
Xin Gao
Yiming Cong
Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
IEEE Access
Big location data
privacy preserving data publishing
adaptive sampling
differential privacy
heuristic quad-tree partitioning
author_facet Yan Yan
Lianxiu Zhang
Quan Z. Sheng
Bingqian Wang
Xin Gao
Yiming Cong
author_sort Yan Yan
title Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
title_short Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
title_full Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
title_fullStr Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
title_full_unstemmed Dynamic Release of Big Location Data Based on Adaptive Sampling and Differential Privacy
title_sort dynamic release of big location data based on adaptive sampling and differential privacy
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Data releasing is a key part bridging between the collection of big data and their applications. Traditional methods release the static version of dataset or publish the snapshot with a fixed sampling interval, which cannot meet the dynamic query requirements and query precision for big data. Moreover, the quality of published data cannot reflect the characteristics of the dynamic changes of big data, which often leads to subsequent data analysis and mining errors. This paper proposes an adaptive sampling mechanism and privacy protection method for the release of big location data. In order to reflect the dynamic change of data in time, we design an adaptive sampling mechanism based on the proportional-integral-derivative (PID) controller according to the temporal and spatial correlation of the location data. To ensure the privacy of published data, we propose a heuristic quad-tree partitioning method as well as a corresponding privacy budget allocation strategy. Experiments and analysis prove that the adaptive sampling mechanism proposed in this paper can effectively track the trend of dynamic changes of data, and the designed differential privacy method can improve the accuracy of counting query and enhance the availability of published data under the premise of certain privacy intensity. The proposed methods can also be readily extended to other areas of big data release applications.
topic Big location data
privacy preserving data publishing
adaptive sampling
differential privacy
heuristic quad-tree partitioning
url https://ieeexplore.ieee.org/document/8890933/
work_keys_str_mv AT yanyan dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy
AT lianxiuzhang dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy
AT quanzsheng dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy
AT bingqianwang dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy
AT xingao dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy
AT yimingcong dynamicreleaseofbiglocationdatabasedonadaptivesamplinganddifferentialprivacy
_version_ 1724188203049549824