metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models
Abstract Background The rapid growth of high-throughput sequencing-based microbiome profiling has yielded tremendous insights into human health and physiology. Data generated from high-throughput sequencing of 16S rRNA gene amplicons are often preprocessed into composition or relative abundance. How...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2019-04-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-019-2744-2 |
id |
doaj-502ed0e4c13449d6b17e54c810084f48 |
---|---|
record_format |
Article |
spelling |
doaj-502ed0e4c13449d6b17e54c810084f482020-11-25T02:23:37ZengBMCBMC Bioinformatics1471-21052019-04-0120111510.1186/s12859-019-2744-2metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects modelsNhan Thi Ho0Fan Li1Shuang Wang2Louise Kuhn3Gertrude H. Sergievsky Center, Columbia UniversityDepartment of Pediatrics, University of CaliforniaDepartment of Biostatistics, Mailman School of Public Health, Columbia UniversityGertrude H. Sergievsky Center, Columbia UniversityAbstract Background The rapid growth of high-throughput sequencing-based microbiome profiling has yielded tremendous insights into human health and physiology. Data generated from high-throughput sequencing of 16S rRNA gene amplicons are often preprocessed into composition or relative abundance. However, reproducibility has been lacking due to the myriad of different experimental and computational approaches taken in these studies. Microbiome studies may report varying results on the same topic, therefore, meta-analyses examining different microbiome studies to provide consistent and robust results are important. So far, there is still a lack of implemented methods to properly examine differential relative abundances of microbial taxonomies and to perform meta-analysis examining the heterogeneity and overall effects across microbiome studies. Results We developed an R package ‘metamicrobiomeR’ that applies Generalized Additive Models for Location, Scale and Shape (GAMLSS) with a zero-inflated beta (BEZI) family (GAMLSS-BEZI) for analysis of microbiome relative abundance datasets. Both simulation studies and application to real microbiome data demonstrate that GAMLSS-BEZI well performs in testing differential relative abundances of microbial taxonomies. Importantly, the estimates from GAMLSS-BEZI are log (odds ratio) of relative abundances between comparison groups and thus are analogous between microbiome studies. As such, we also apply random effects meta-analysis models to pool estimates and their standard errors across microbiome studies. We demonstrate the meta-analysis examples and highlight the utility of our package on four studies comparing gut microbiomes between male and female infants in the first six months of life. Conclusions GAMLSS-BEZI allows proper examination of microbiome relative abundance data. Random effects meta-analysis models can be directly applied to pool comparable estimates and their standard errors to evaluate the overall effects and heterogeneity across microbiome studies. The examples and workflow using our ‘metamicrobiomeR’ package are reproducible and applicable for the analyses and meta-analyses of other microbiome studies.http://link.springer.com/article/10.1186/s12859-019-2744-2MicrobiomeRelative abundanceGAMLSSZero-inflated betaMeta-analysisRandom effect |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nhan Thi Ho Fan Li Shuang Wang Louise Kuhn |
spellingShingle |
Nhan Thi Ho Fan Li Shuang Wang Louise Kuhn metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models BMC Bioinformatics Microbiome Relative abundance GAMLSS Zero-inflated beta Meta-analysis Random effect |
author_facet |
Nhan Thi Ho Fan Li Shuang Wang Louise Kuhn |
author_sort |
Nhan Thi Ho |
title |
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models |
title_short |
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models |
title_full |
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models |
title_fullStr |
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models |
title_full_unstemmed |
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models |
title_sort |
metamicrobiomer: an r package for analysis of microbiome relative abundance data using zero-inflated beta gamlss and meta-analysis across studies using random effects models |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2019-04-01 |
description |
Abstract Background The rapid growth of high-throughput sequencing-based microbiome profiling has yielded tremendous insights into human health and physiology. Data generated from high-throughput sequencing of 16S rRNA gene amplicons are often preprocessed into composition or relative abundance. However, reproducibility has been lacking due to the myriad of different experimental and computational approaches taken in these studies. Microbiome studies may report varying results on the same topic, therefore, meta-analyses examining different microbiome studies to provide consistent and robust results are important. So far, there is still a lack of implemented methods to properly examine differential relative abundances of microbial taxonomies and to perform meta-analysis examining the heterogeneity and overall effects across microbiome studies. Results We developed an R package ‘metamicrobiomeR’ that applies Generalized Additive Models for Location, Scale and Shape (GAMLSS) with a zero-inflated beta (BEZI) family (GAMLSS-BEZI) for analysis of microbiome relative abundance datasets. Both simulation studies and application to real microbiome data demonstrate that GAMLSS-BEZI well performs in testing differential relative abundances of microbial taxonomies. Importantly, the estimates from GAMLSS-BEZI are log (odds ratio) of relative abundances between comparison groups and thus are analogous between microbiome studies. As such, we also apply random effects meta-analysis models to pool estimates and their standard errors across microbiome studies. We demonstrate the meta-analysis examples and highlight the utility of our package on four studies comparing gut microbiomes between male and female infants in the first six months of life. Conclusions GAMLSS-BEZI allows proper examination of microbiome relative abundance data. Random effects meta-analysis models can be directly applied to pool comparable estimates and their standard errors to evaluate the overall effects and heterogeneity across microbiome studies. The examples and workflow using our ‘metamicrobiomeR’ package are reproducible and applicable for the analyses and meta-analyses of other microbiome studies. |
topic |
Microbiome Relative abundance GAMLSS Zero-inflated beta Meta-analysis Random effect |
url |
http://link.springer.com/article/10.1186/s12859-019-2744-2 |
work_keys_str_mv |
AT nhanthiho metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels AT fanli metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels AT shuangwang metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels AT louisekuhn metamicrobiomeranrpackageforanalysisofmicrobiomerelativeabundancedatausingzeroinflatedbetagamlssandmetaanalysisacrossstudiesusingrandomeffectsmodels |
_version_ |
1724858481170710528 |