Reconstructing Networks with Unknown and Heterogeneous Errors

The vast majority of network data sets contain errors and omissions, although this fact is rarely incorporated in traditional network analysis. Recently, an increasing effort has been made to fill this methodological gap by developing network-reconstruction approaches based on Bayesian inference. Th...

Full description

Bibliographic Details
Main Author: Tiago P. Peixoto
Format: Article
Language:English
Published: American Physical Society 2018-10-01
Series:Physical Review X
Online Access:http://doi.org/10.1103/PhysRevX.8.041011
id doaj-5aa6571fa0a34b1eb53c3648e5f66429
record_format Article
spelling doaj-5aa6571fa0a34b1eb53c3648e5f664292020-11-24T22:03:16ZengAmerican Physical SocietyPhysical Review X2160-33082018-10-018404101110.1103/PhysRevX.8.041011Reconstructing Networks with Unknown and Heterogeneous ErrorsTiago P. PeixotoThe vast majority of network data sets contain errors and omissions, although this fact is rarely incorporated in traditional network analysis. Recently, an increasing effort has been made to fill this methodological gap by developing network-reconstruction approaches based on Bayesian inference. These approaches, however, rely on assumptions of uniform error rates and on direct estimations of the existence of each edge via repeated measurements, something that is currently unavailable for the majority of network data. Here, we develop a Bayesian reconstruction approach that lifts these limitations by allowing for not only heterogeneous errors, but also for single edge measurements without direct error estimates. Our approach works by coupling the inference approach with structured generative network models, which enable the correlations between edges to be used as reliable uncertainty estimates. Although our approach is general, we focus on the stochastic block model as the basic generative process, from which efficient nonparametric inference can be performed and yields a principled method to infer hierarchical community structure from noisy data. We demonstrate the efficacy of our approach with a variety of empirical and artificial networks.http://doi.org/10.1103/PhysRevX.8.041011
collection DOAJ
language English
format Article
sources DOAJ
author Tiago P. Peixoto
spellingShingle Tiago P. Peixoto
Reconstructing Networks with Unknown and Heterogeneous Errors
Physical Review X
author_facet Tiago P. Peixoto
author_sort Tiago P. Peixoto
title Reconstructing Networks with Unknown and Heterogeneous Errors
title_short Reconstructing Networks with Unknown and Heterogeneous Errors
title_full Reconstructing Networks with Unknown and Heterogeneous Errors
title_fullStr Reconstructing Networks with Unknown and Heterogeneous Errors
title_full_unstemmed Reconstructing Networks with Unknown and Heterogeneous Errors
title_sort reconstructing networks with unknown and heterogeneous errors
publisher American Physical Society
series Physical Review X
issn 2160-3308
publishDate 2018-10-01
description The vast majority of network data sets contain errors and omissions, although this fact is rarely incorporated in traditional network analysis. Recently, an increasing effort has been made to fill this methodological gap by developing network-reconstruction approaches based on Bayesian inference. These approaches, however, rely on assumptions of uniform error rates and on direct estimations of the existence of each edge via repeated measurements, something that is currently unavailable for the majority of network data. Here, we develop a Bayesian reconstruction approach that lifts these limitations by allowing for not only heterogeneous errors, but also for single edge measurements without direct error estimates. Our approach works by coupling the inference approach with structured generative network models, which enable the correlations between edges to be used as reliable uncertainty estimates. Although our approach is general, we focus on the stochastic block model as the basic generative process, from which efficient nonparametric inference can be performed and yields a principled method to infer hierarchical community structure from noisy data. We demonstrate the efficacy of our approach with a variety of empirical and artificial networks.
url http://doi.org/10.1103/PhysRevX.8.041011
work_keys_str_mv AT tiagoppeixoto reconstructingnetworkswithunknownandheterogeneouserrors
_version_ 1716603938903425024