Enabling accurate analysis of private network data

This dissertation addresses the challenge of enabling accurate analysis of network data while ensuring the protection of network participants' privacy. This is an important problem: massive amounts of data are being collected (facebook activity, email correspondence, cell phone records), there...

Full description

Bibliographic Details
Main Author: Hay, Michael G
Language:ENG
Published: ScholarWorks@UMass Amherst 2010
Subjects:
Online Access:https://scholarworks.umass.edu/dissertations/AAI3427533
id ndltd-UMASS-oai-scholarworks.umass.edu-dissertations-6113
record_format oai_dc
spelling ndltd-UMASS-oai-scholarworks.umass.edu-dissertations-61132020-12-02T14:32:13Z Enabling accurate analysis of private network data Hay, Michael G This dissertation addresses the challenge of enabling accurate analysis of network data while ensuring the protection of network participants' privacy. This is an important problem: massive amounts of data are being collected (facebook activity, email correspondence, cell phone records), there is huge interest in analyzing the data, but the data is not being shared due to concerns about privacy. Despite much research in privacy-preserving data analysis, existing technologies fail to provide a solution because they were designed for tables, not networks, and cannot be easily adapted to handle the complexities of network data. We develop several technologies that advance us toward our goal. First, we develop a framework for assessing the risk of publishing a network that has been “anonymized.” Using this framework, we show that only a small amount of background knowledge about local network structure is needed to re-identify an “anonymous” individual. This motivates our second contribution: an algorithm that transforms the structure of the network to provably lower re-identification risk. In comparison with other algorithms, we show that our approach more accurately preserves important features of the network topology. Finally, we consider an alternative paradigm, in which the analyst can analyze private data through a carefully controlled query interface. We show that the degree sequence of a network can be accurately estimated under strong guarantees of privacy. 2010-01-01T08:00:00Z text https://scholarworks.umass.edu/dissertations/AAI3427533 Doctoral Dissertations Available from Proquest ENG ScholarWorks@UMass Amherst Computer science
collection NDLTD
language ENG
sources NDLTD
topic Computer science
spellingShingle Computer science
Hay, Michael G
Enabling accurate analysis of private network data
description This dissertation addresses the challenge of enabling accurate analysis of network data while ensuring the protection of network participants' privacy. This is an important problem: massive amounts of data are being collected (facebook activity, email correspondence, cell phone records), there is huge interest in analyzing the data, but the data is not being shared due to concerns about privacy. Despite much research in privacy-preserving data analysis, existing technologies fail to provide a solution because they were designed for tables, not networks, and cannot be easily adapted to handle the complexities of network data. We develop several technologies that advance us toward our goal. First, we develop a framework for assessing the risk of publishing a network that has been “anonymized.” Using this framework, we show that only a small amount of background knowledge about local network structure is needed to re-identify an “anonymous” individual. This motivates our second contribution: an algorithm that transforms the structure of the network to provably lower re-identification risk. In comparison with other algorithms, we show that our approach more accurately preserves important features of the network topology. Finally, we consider an alternative paradigm, in which the analyst can analyze private data through a carefully controlled query interface. We show that the degree sequence of a network can be accurately estimated under strong guarantees of privacy.
author Hay, Michael G
author_facet Hay, Michael G
author_sort Hay, Michael G
title Enabling accurate analysis of private network data
title_short Enabling accurate analysis of private network data
title_full Enabling accurate analysis of private network data
title_fullStr Enabling accurate analysis of private network data
title_full_unstemmed Enabling accurate analysis of private network data
title_sort enabling accurate analysis of private network data
publisher ScholarWorks@UMass Amherst
publishDate 2010
url https://scholarworks.umass.edu/dissertations/AAI3427533
work_keys_str_mv AT haymichaelg enablingaccurateanalysisofprivatenetworkdata
_version_ 1719364417344241664