metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models

Abstract Background The rapid growth of high-throughput sequencing-based microbiome profiling has yielded tremendous insights into human health and physiology. Data generated from high-throughput sequencing of 16S rRNA gene amplicons are often preprocessed into composition or relative abundance. How...

Full description

Bibliographic Details
Main Authors: Nhan Thi Ho, Fan Li, Shuang Wang, Louise Kuhn
Format: Article
Language:English
Published: BMC 2019-04-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-019-2744-2
id doaj-502ed0e4c13449d6b17e54c810084f48
record_format Article
spelling doaj-502ed0e4c13449d6b17e54c810084f482020-11-25T02:23:37ZengBMCBMC Bioinformatics1471-21052019-04-0120111510.1186/s12859-019-2744-2metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects modelsNhan Thi Ho0Fan Li1Shuang Wang2Louise Kuhn3Gertrude H. Sergievsky Center, Columbia UniversityDepartment of Pediatrics, University of CaliforniaDepartment of Biostatistics, Mailman School of Public Health, Columbia UniversityGertrude H. Sergievsky Center, Columbia UniversityAbstract Background The rapid growth of high-throughput sequencing-based microbiome profiling has yielded tremendous insights into human health and physiology. Data generated from high-throughput sequencing of 16S rRNA gene amplicons are often preprocessed into composition or relative abundance. However, reproducibility has been lacking due to the myriad of different experimental and computational approaches taken in these studies. Microbiome studies may report varying results on the same topic, therefore, meta-analyses examining different microbiome studies to provide consistent and robust results are important. So far, there is still a lack of implemented methods to properly examine differential relative abundances of microbial taxonomies and to perform meta-analysis examining the heterogeneity and overall effects across microbiome studies. Results We developed an R package ‘metamicrobiomeR’ that applies Generalized Additive Models for Location, Scale and Shape (GAMLSS) with a zero-inflated beta (BEZI) family (GAMLSS-BEZI) for analysis of microbiome relative abundance datasets. Both simulation studies and application to real microbiome data demonstrate that GAMLSS-BEZI well performs in testing differential relative abundances of microbial taxonomies. Importantly, the estimates from GAMLSS-BEZI are log (odds ratio) of relative abundances between comparison groups and thus are analogous between microbiome studies. As such, we also apply random effects meta-analysis models to pool estimates and their standard errors across microbiome studies. We demonstrate the meta-analysis examples and highlight the utility of our package on four studies comparing gut microbiomes between male and female infants in the first six months of life. Conclusions GAMLSS-BEZI allows proper examination of microbiome relative abundance data. Random effects meta-analysis models can be directly applied to pool comparable estimates and their standard errors to evaluate the overall effects and heterogeneity across microbiome studies. The examples and workflow using our ‘metamicrobiomeR’ package are reproducible and applicable for the analyses and meta-analyses of other microbiome studies.http://link.springer.com/article/10.1186/s12859-019-2744-2MicrobiomeRelative abundanceGAMLSSZero-inflated betaMeta-analysisRandom effect
collection DOAJ
language English
format Article
sources DOAJ
author Nhan Thi Ho
Fan Li
Shuang Wang
Louise Kuhn
spellingShingle Nhan Thi Ho
Fan Li
Shuang Wang
Louise Kuhn
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
BMC Bioinformatics
Microbiome
Relative abundance
GAMLSS
Zero-inflated beta
Meta-analysis
Random effect
author_facet Nhan Thi Ho
Fan Li
Shuang Wang
Louise Kuhn
author_sort Nhan Thi Ho
title metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
title_short metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
title_full metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
title_fullStr metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
title_full_unstemmed metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
title_sort metamicrobiomer: an r package for analysis of microbiome relative abundance data using zero-inflated beta gamlss and meta-analysis across studies using random effects models
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-04-01
description Abstract Background The rapid growth of high-throughput sequencing-based microbiome profiling has yielded tremendous insights into human health and physiology. Data generated from high-throughput sequencing of 16S rRNA gene amplicons are often preprocessed into composition or relative abundance. However, reproducibility has been lacking due to the myriad of different experimental and computational approaches taken in these studies. Microbiome studies may report varying results on the same topic, therefore, meta-analyses examining different microbiome studies to provide consistent and robust results are important. So far, there is still a lack of implemented methods to properly examine differential relative abundances of microbial taxonomies and to perform meta-analysis examining the heterogeneity and overall effects across microbiome studies. Results We developed an R package ‘metamicrobiomeR’ that applies Generalized Additive Models for Location, Scale and Shape (GAMLSS) with a zero-inflated beta (BEZI) family (GAMLSS-BEZI) for analysis of microbiome relative abundance datasets. Both simulation studies and application to real microbiome data demonstrate that GAMLSS-BEZI well performs in testing differential relative abundances of microbial taxonomies. Importantly, the estimates from GAMLSS-BEZI are log (odds ratio) of relative abundances between comparison groups and thus are analogous between microbiome studies. As such, we also apply random effects meta-analysis models to pool estimates and their standard errors across microbiome studies. We demonstrate the meta-analysis examples and highlight the utility of our package on four studies comparing gut microbiomes between male and female infants in the first six months of life. Conclusions GAMLSS-BEZI allows proper examination of microbiome relative abundance data. Random effects meta-analysis models can be directly applied to pool comparable estimates and their standard errors to evaluate the overall effects and heterogeneity across microbiome studies. The examples and workflow using our ‘metamicrobiomeR’ package are reproducible and applicable for the analyses and meta-analyses of other microbiome studies.
topic Microbiome
Relative abundance
GAMLSS
Zero-inflated beta
Meta-analysis
Random effect
url http://link.springer.com/article/10.1186/s12859-019-2744-2
work_keys_str_mv AT nhanthiho metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels
AT fanli metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels
AT shuangwang metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels
AT louisekuhn metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels
_version_ 1724858481170710528