Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response

Privacy security issues under the classic randomized response (RR) model proposed by Warner and its extended K-RR model are studied. First, in order to provide references for the accuracy of the private distribution estimation problem under RR mechanism, the lower bounds of the differential privacy...

Full description

Bibliographic Details
Main Authors: Haina Song, Tao Luo, Jianfeng Li
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8618440/
id doaj-fcedce7d53dc4db996adc1b3d3ce73ce
record_format Article
spelling doaj-fcedce7d53dc4db996adc1b3d3ce73ce2021-03-29T22:23:44ZengIEEEIEEE Access2169-35362019-01-017169641697810.1109/ACCESS.2019.28934078618440Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized ResponseHaina Song0https://orcid.org/0000-0001-6973-2267Tao Luo1Jianfeng Li2Beijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, ChinaBeijing Laboratory of Advanced Information Networks, Beijing University of Posts and Telecommunications, Beijing, ChinaPrivacy security issues under the classic randomized response (RR) model proposed by Warner and its extended K-RR model are studied. First, in order to provide references for the accuracy of the private distribution estimation problem under RR mechanism, the lower bounds of the differential privacy parameter and the number of participates are deduced for an accuracy objective of (&#x03B1;, &#x03B4;)-accurate in statistics. Second, when the prior distribution is nonuniform, the data utility has a ceiling effect in the high privacy region by taking the prior into account. In this case, the average distortion, which is defined as the expected Hamming distance between the input and output data, is no longer feasible to measure the data utility. Motivated by this, the error probability is proposed as a measure of data utility for unifying different privacy metrics, where the error probability is defined to be the expected Hamming distance between the input and reconstructed data based on maximum a posteriori estimation. Third, under a unified privacy-preserving framework using RR mechanism based on error probability criterion, the relationship among differential privacy, identifiability privacy, and mutual information privacy is established. Given a maximum allowable error probability P<sub>E</sub><sup>max</sup>, the optimal privacy parameters of these three privacy notions are derived with the full consideration of the prior distribution. Then, a Bayes-based utility function, which corresponds to the converse of the Bayes risk, is constructed to measure the degree of privacy leakage. Given a maximum allowable correct probability P<sub>C</sub><sup>max</sup>, the accuracy objective of the statistical estimate is considered to derive the range of local differential privacy parameter from the perspective of security. Fourth, all the research results above are further extended to K-RR mechanism too. Finally, the correctness and effectiveness are further verified by simulation experiments. The results reveal that the error probability can be applied to any prior distribution case, while the average distortion criterion is only a special case with uniform distribution. Therefore, the error probability proposed in the paper is more reasonable to be used as a common criterion to measure the data utility for the RR model so as to unify different privacy metrics.https://ieeexplore.ieee.org/document/8618440/Error probabilitymaximum a posteriori estimationlocal differential privacyrandomized responsedata utilityprivacy preservation
collection DOAJ
language English
format Article
sources DOAJ
author Haina Song
Tao Luo
Jianfeng Li
spellingShingle Haina Song
Tao Luo
Jianfeng Li
Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
IEEE Access
Error probability
maximum a posteriori estimation
local differential privacy
randomized response
data utility
privacy preservation
author_facet Haina Song
Tao Luo
Jianfeng Li
author_sort Haina Song
title Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
title_short Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
title_full Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
title_fullStr Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
title_full_unstemmed Common Criterion of Privacy Metrics and Parameters Analysis Based on Error Probability for Randomized Response
title_sort common criterion of privacy metrics and parameters analysis based on error probability for randomized response
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Privacy security issues under the classic randomized response (RR) model proposed by Warner and its extended K-RR model are studied. First, in order to provide references for the accuracy of the private distribution estimation problem under RR mechanism, the lower bounds of the differential privacy parameter and the number of participates are deduced for an accuracy objective of (&#x03B1;, &#x03B4;)-accurate in statistics. Second, when the prior distribution is nonuniform, the data utility has a ceiling effect in the high privacy region by taking the prior into account. In this case, the average distortion, which is defined as the expected Hamming distance between the input and output data, is no longer feasible to measure the data utility. Motivated by this, the error probability is proposed as a measure of data utility for unifying different privacy metrics, where the error probability is defined to be the expected Hamming distance between the input and reconstructed data based on maximum a posteriori estimation. Third, under a unified privacy-preserving framework using RR mechanism based on error probability criterion, the relationship among differential privacy, identifiability privacy, and mutual information privacy is established. Given a maximum allowable error probability P<sub>E</sub><sup>max</sup>, the optimal privacy parameters of these three privacy notions are derived with the full consideration of the prior distribution. Then, a Bayes-based utility function, which corresponds to the converse of the Bayes risk, is constructed to measure the degree of privacy leakage. Given a maximum allowable correct probability P<sub>C</sub><sup>max</sup>, the accuracy objective of the statistical estimate is considered to derive the range of local differential privacy parameter from the perspective of security. Fourth, all the research results above are further extended to K-RR mechanism too. Finally, the correctness and effectiveness are further verified by simulation experiments. The results reveal that the error probability can be applied to any prior distribution case, while the average distortion criterion is only a special case with uniform distribution. Therefore, the error probability proposed in the paper is more reasonable to be used as a common criterion to measure the data utility for the RR model so as to unify different privacy metrics.
topic Error probability
maximum a posteriori estimation
local differential privacy
randomized response
data utility
privacy preservation
url https://ieeexplore.ieee.org/document/8618440/
work_keys_str_mv AT hainasong commoncriterionofprivacymetricsandparametersanalysisbasedonerrorprobabilityforrandomizedresponse
AT taoluo commoncriterionofprivacymetricsandparametersanalysisbasedonerrorprobabilityforrandomizedresponse
AT jianfengli commoncriterionofprivacymetricsandparametersanalysisbasedonerrorprobabilityforrandomizedresponse
_version_ 1724191803086733312