Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
Privacy security issues under the classic randomized response (RR) model proposed by Warner and its extended K-RR model are studied. First, in order to provide references for the accuracy of the private distribution estimation problem under RR mechanism, the lower bounds of the differential privacy...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8618440/ |
id |
doaj-fcedce7d53dc4db996adc1b3d3ce73ce |
---|---|
record_format |
Article |
spelling |
doaj-fcedce7d53dc4db996adc1b3d3ce73ce2021-03-29T22:23:44ZengIEEEIEEE Access2169-35362019-01-017169641697810.1109/ACCESS.2019.28934078618440Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized ResponseHaina Song0https://orcid.org/0000-0001-6973-2267Tao Luo1Jianfeng Li2Beijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, ChinaPrivacy security issues under the classic randomized response (RR) model proposed by Warner and its extended K-RR model are studied. First, in order to provide references for the accuracy of the private distribution estimation problem under RR mechanism, the lower bounds of the differential privacy parameter and the number of participates are deduced for an accuracy objective of (α, δ)-accurate in statistics. Second, when the prior distribution is nonuniform, the data utility has a ceiling effect in the high privacy region by taking the prior into account. In this case, the average distortion, which is defined as the expected Hamming distance between the input and output data, is no longer feasible to measure the data utility. Motivated by this, the error probability is proposed as a measure of data utility for unifying different privacy metrics, where the error probability is defined to be the expected Hamming distance between the input and reconstructed data based on maximum a posteriori estimation. Third, under a unified privacy-preserving framework using RR mechanism based on error probability criterion, the relationship among differential privacy, identifiability privacy, and mutual information privacy is established. Given a maximum allowable error probability P<sub>E</sub><sup>max</sup>, the optimal privacy parameters of these three privacy notions are derived with the full consideration of the prior distribution. Then, a Bayes-based utility function, which corresponds to the converse of the Bayes risk, is constructed to measure the degree of privacy leakage. Given a maximum allowable correct probability P<sub>C</sub><sup>max</sup>, the accuracy objective of the statistical estimate is considered to derive the range of local differential privacy parameter from the perspective of security. Fourth, all the research results above are further extended to K-RR mechanism too. Finally, the correctness and effectiveness are further verified by simulation experiments. The results reveal that the error probability can be applied to any prior distribution case, while the average distortion criterion is only a special case with uniform distribution. Therefore, the error probability proposed in the paper is more reasonable to be used as a common criterion to measure the data utility for the RR model so as to unify different privacy metrics.https://ieeexplore.ieee.org/document/8618440/Error probabilitymaximum a posteriori estimationlocal differential privacyrandomized responsedata utilityprivacy preservation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Haina Song Tao Luo Jianfeng Li |
spellingShingle |
Haina Song Tao Luo Jianfeng Li Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response IEEE Access Error probability maximum a posteriori estimation local differential privacy randomized response data utility privacy preservation |
author_facet |
Haina Song Tao Luo Jianfeng Li |
author_sort |
Haina Song |
title |
Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response |
title_short |
Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response |
title_full |
Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response |
title_fullStr |
Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response |
title_full_unstemmed |
Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response |
title_sort |
common criterion of privacy metrics and parameters analysis based on error probability for randomized response |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Privacy security issues under the classic randomized response (RR) model proposed by Warner and its extended K-RR model are studied. First, in order to provide references for the accuracy of the private distribution estimation problem under RR mechanism, the lower bounds of the differential privacy parameter and the number of participates are deduced for an accuracy objective of (α, δ)-accurate in statistics. Second, when the prior distribution is nonuniform, the data utility has a ceiling effect in the high privacy region by taking the prior into account. In this case, the average distortion, which is defined as the expected Hamming distance between the input and output data, is no longer feasible to measure the data utility. Motivated by this, the error probability is proposed as a measure of data utility for unifying different privacy metrics, where the error probability is defined to be the expected Hamming distance between the input and reconstructed data based on maximum a posteriori estimation. Third, under a unified privacy-preserving framework using RR mechanism based on error probability criterion, the relationship among differential privacy, identifiability privacy, and mutual information privacy is established. Given a maximum allowable error probability P<sub>E</sub><sup>max</sup>, the optimal privacy parameters of these three privacy notions are derived with the full consideration of the prior distribution. Then, a Bayes-based utility function, which corresponds to the converse of the Bayes risk, is constructed to measure the degree of privacy leakage. Given a maximum allowable correct probability P<sub>C</sub><sup>max</sup>, the accuracy objective of the statistical estimate is considered to derive the range of local differential privacy parameter from the perspective of security. Fourth, all the research results above are further extended to K-RR mechanism too. Finally, the correctness and effectiveness are further verified by simulation experiments. The results reveal that the error probability can be applied to any prior distribution case, while the average distortion criterion is only a special case with uniform distribution. Therefore, the error probability proposed in the paper is more reasonable to be used as a common criterion to measure the data utility for the RR model so as to unify different privacy metrics. |
topic |
Error probability maximum a posteriori estimation local differential privacy randomized response data utility privacy preservation |
url |
https://ieeexplore.ieee.org/document/8618440/ |
work_keys_str_mv |
AT hainasong commoncriterionofprivacymetricsandparametersanalysisbasedonerrorprobabilityforrandomizedresponse AT taoluo commoncriterionofprivacymetricsandparametersanalysisbasedonerrorprobabilityforrandomizedresponse AT jianfengli commoncriterionofprivacymetricsandparametersanalysisbasedonerrorprobabilityforrandomizedresponse |
_version_ |
1724191803086733312 |