HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity
The human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct mic...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2020-06-01
|
Series: | Frontiers in Genetics |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/fgene.2020.00445/full |
id |
doaj-6e73e57eddb1485797d7cd93175a4680 |
---|---|
record_format |
Article |
spelling |
doaj-6e73e57eddb1485797d7cd93175a46802020-11-25T03:31:00ZengFrontiers Media S.A.Frontiers in Genetics1664-80212020-06-011110.3389/fgene.2020.00445520763HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting SparsityShuang Jiang0Shuang Jiang1Guanghua Xiao2Andrew Y. Koh3Yingfei Chen4Bo Yao5Qiwei Li6Xiaowei Zhan7Department of Statistical Science, Southern Methodist University, Dallas, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesDepartments of Pediatrics, Departments of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX, United StatesLyda Hill Department of Bioinformatics, Bioinformatics High Performance Computing, University of Texas Southwestern Medical Center, Dallas, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesDepartment of Mathematical Sciences, The University of Texas at Dallas, Richardson, TX, United StatesQuantitative Biomedical Research Center, Department of Population and Data Sciences, University of Texas Southwestern Medical Center, Dallas, TX, United StatesThe human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct microbial networks from the microbiome sequencing count data. As microbiome count data are high-dimensional and suffer from uneven sampling depth, over-dispersion, and zero-inflation, these characteristics can bias the network estimation and require specialized analytical tools. Here we propose a general framework, HARMONIES, Hybrid Approach foR MicrobiOme Network Inferences via Exploiting Sparsity, to infer a sparse microbiome network. HARMONIES first utilizes a zero-inflated negative binomial (ZINB) distribution to model the skewness and excess zeros in the microbiome data, as well as incorporates a stochastic process prior for sample-wise normalization. This approach infers a sparse and stable network by imposing non-trivial regularizations based on the Gaussian graphical model. In comprehensive simulation studies, HARMONIES outperformed four other commonly used methods. When using published microbiome data from a colorectal cancer study, it discovered a novel community with disease-enriched bacteria. In summary, HARMONIES is a novel and useful statistical framework for microbiome network inference, and it is available at https://github.com/shuangj00/HARMONIES.https://www.frontiersin.org/article/10.3389/fgene.2020.00445/fullBayesian statisticsmicrobiome networkGaussian graphical modelDirichlet process priorhierarchical model |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shuang Jiang Shuang Jiang Guanghua Xiao Andrew Y. Koh Yingfei Chen Bo Yao Qiwei Li Xiaowei Zhan |
spellingShingle |
Shuang Jiang Shuang Jiang Guanghua Xiao Andrew Y. Koh Yingfei Chen Bo Yao Qiwei Li Xiaowei Zhan HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity Frontiers in Genetics Bayesian statistics microbiome network Gaussian graphical model Dirichlet process prior hierarchical model |
author_facet |
Shuang Jiang Shuang Jiang Guanghua Xiao Andrew Y. Koh Yingfei Chen Bo Yao Qiwei Li Xiaowei Zhan |
author_sort |
Shuang Jiang |
title |
HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity |
title_short |
HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity |
title_full |
HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity |
title_fullStr |
HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity |
title_full_unstemmed |
HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity |
title_sort |
harmonies: a hybrid approach for microbiome networks inference via exploiting sparsity |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Genetics |
issn |
1664-8021 |
publishDate |
2020-06-01 |
description |
The human microbiome is a collection of microorganisms. They form complex communities and collectively affect host health. Recently, the advances in next-generation sequencing technology enable the high-throughput profiling of the human microbiome. This calls for a statistical model to construct microbial networks from the microbiome sequencing count data. As microbiome count data are high-dimensional and suffer from uneven sampling depth, over-dispersion, and zero-inflation, these characteristics can bias the network estimation and require specialized analytical tools. Here we propose a general framework, HARMONIES, Hybrid Approach foR MicrobiOme Network Inferences via Exploiting Sparsity, to infer a sparse microbiome network. HARMONIES first utilizes a zero-inflated negative binomial (ZINB) distribution to model the skewness and excess zeros in the microbiome data, as well as incorporates a stochastic process prior for sample-wise normalization. This approach infers a sparse and stable network by imposing non-trivial regularizations based on the Gaussian graphical model. In comprehensive simulation studies, HARMONIES outperformed four other commonly used methods. When using published microbiome data from a colorectal cancer study, it discovered a novel community with disease-enriched bacteria. In summary, HARMONIES is a novel and useful statistical framework for microbiome network inference, and it is available at https://github.com/shuangj00/HARMONIES. |
topic |
Bayesian statistics microbiome network Gaussian graphical model Dirichlet process prior hierarchical model |
url |
https://www.frontiersin.org/article/10.3389/fgene.2020.00445/full |
work_keys_str_mv |
AT shuangjiang harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT shuangjiang harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT guanghuaxiao harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT andrewykoh harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT yingfeichen harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT boyao harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT qiweili harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity AT xiaoweizhan harmoniesahybridapproachformicrobiomenetworksinferenceviaexploitingsparsity |
_version_ |
1724574237227745280 |