HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity

The human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct mic...

Full description

Bibliographic Details
Main Authors: Shuang Jiang, Guanghua Xiao, Andrew Y. Koh, Yingfei Chen, Bo Yao, Qiwei Li, Xiaowei Zhan
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-06-01
Series:Frontiers in Genetics
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/fgene.2020.00445/full
id doaj-6e73e57eddb1485797d7cd93175a4680
record_format Article
spelling doaj-6e73e57eddb1485797d7cd93175a46802020-11-25T03:31:00ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-06-011110.3389/fgene.2020.00445520763HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting SparsityShuang Jiang0Shuang Jiang1Guanghua Xiao2Andrew Y. Koh3Yingfei Chen4Bo Yao5Qiwei Li6Xiaowei Zhan7Department of Statistical Science, Southern Methodist University, Dallas, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesDepartments of Pediatrics, Departments of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX, United StatesLyda Hill Department of Bioinformatics, Bioinformatics High Performance Computing, University of Texas Southwestern Medical Center, Dallas, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesDepartment of Mathematical Sciences, The University of Texas at Dallas, Richardson, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesThe human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct microbial networks from the microbiome sequencing count data. As microbiome count data are high-dimensional and suffer from uneven sampling depth, over-dispersion, and zero-inflation, these characteristics can bias the network estimation and require specialized analytical tools. Here we propose a general framework, HARMONIES, Hybrid Approach foR MicrobiOme Network Inferences via Exploiting Sparsity, to infer a sparse microbiome network. HARMONIES first utilizes a zero-inflated negative binomial (ZINB) distribution to model the skewness and excess zeros in the microbiome data, as well as incorporates a stochastic process prior for sample-wise normalization. This approach infers a sparse and stable network by imposing non-trivial regularizations based on the Gaussian graphical model. In comprehensive simulation studies, HARMONIES outperformed four other commonly used methods. When using published microbiome data from a colorectal cancer study, it discovered a novel community with disease-enriched bacteria. In summary, HARMONIES is a novel and useful statistical framework for microbiome network inference, and it is available at https://github.com/shuangj00/HARMONIES.https://www.frontiersin.org/article/10.3389/fgene.2020.00445/fullBayesian statisticsmicrobiome networkGaussian graphical modelDirichlet process priorhierarchical model
collection DOAJ
language English
format Article
sources DOAJ
author Shuang Jiang
Shuang Jiang
Guanghua Xiao
Andrew Y. Koh
Yingfei Chen
Bo Yao
Qiwei Li
Xiaowei Zhan
spellingShingle Shuang Jiang
Shuang Jiang
Guanghua Xiao
Andrew Y. Koh
Yingfei Chen
Bo Yao
Qiwei Li
Xiaowei Zhan
HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
Frontiers in Genetics
Bayesian statistics
microbiome network
Gaussian graphical model
Dirichlet process prior
hierarchical model
author_facet Shuang Jiang
Shuang Jiang
Guanghua Xiao
Andrew Y. Koh
Yingfei Chen
Bo Yao
Qiwei Li
Xiaowei Zhan
author_sort Shuang Jiang
title HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
title_short HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
title_full HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
title_fullStr HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
title_full_unstemmed HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
title_sort harmonies: a hybrid approach for microbiome networks inference via exploiting sparsity
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2020-06-01
description The human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct microbial networks from the microbiome sequencing count data. As microbiome count data are high-dimensional and suffer from uneven sampling depth, over-dispersion, and zero-inflation, these characteristics can bias the network estimation and require specialized analytical tools. Here we propose a general framework, HARMONIES, Hybrid Approach foR MicrobiOme Network Inferences via Exploiting Sparsity, to infer a sparse microbiome network. HARMONIES first utilizes a zero-inflated negative binomial (ZINB) distribution to model the skewness and excess zeros in the microbiome data, as well as incorporates a stochastic process prior for sample-wise normalization. This approach infers a sparse and stable network by imposing non-trivial regularizations based on the Gaussian graphical model. In comprehensive simulation studies, HARMONIES outperformed four other commonly used methods. When using published microbiome data from a colorectal cancer study, it discovered a novel community with disease-enriched bacteria. In summary, HARMONIES is a novel and useful statistical framework for microbiome network inference, and it is available at https://github.com/shuangj00/HARMONIES.
topic Bayesian statistics
microbiome network
Gaussian graphical model
Dirichlet process prior
hierarchical model
url https://www.frontiersin.org/article/10.3389/fgene.2020.00445/full
work_keys_str_mv AT shuangjiang harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT shuangjiang harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT guanghuaxiao harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT andrewykoh harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT yingfeichen harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT boyao harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT qiweili harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
AT xiaoweizhan harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity
_version_ 1724574237227745280