A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals

Abstract Background In human microbiome studies, it is crucial to evaluate the association between microbial group (e.g., community or clade) composition and a host phenotype of interest. In response, a number of microbial group association tests have been proposed, which account for the unique feat...

Full description

Bibliographic Details
Main Authors: Hyunwook Koh, Ni Zhao
Format: Article
Language:English
Published: BMC 2020-05-01
Series:Microbiome
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40168-020-00834-9
id doaj-41773f3893db46a3955763c8a26e1224
record_format Article
spelling doaj-41773f3893db46a3955763c8a26e12242020-11-25T03:23:28ZengBMCMicrobiome2049-26182020-05-018111610.1186/s40168-020-00834-9A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signalsHyunwook Koh0Ni Zhao1Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins UniversityDepartment of Biostatistics, Bloomberg School of Public Health, Johns Hopkins UniversityAbstract Background In human microbiome studies, it is crucial to evaluate the association between microbial group (e.g., community or clade) composition and a host phenotype of interest. In response, a number of microbial group association tests have been proposed, which account for the unique features of the microbiome data (e.g., high-dimensionality, compositionality, phylogenetic relationship). These tests generally fall in the class of aggregation tests which amplify the overall group association by combining all the underlying microbial association signals, and, therefore, they are powerful when many microbial species are associated with a given host phenotype (i.e., low sparsity). However, in practice, the microbial association signals can be highly sparse, and this is especially the situation where we have a difficulty to discover the microbial group association. Methods Here, we introduce a powerful microbial group association test for sparse microbial association signals, namely, microbiome higher criticism analysis (MiHC). MiHC is a data-driven omnibus test taken in a search space spanned by tailoring the higher criticism test to incorporate phylogenetic information and/or modulate sparsity levels and including the Simes test for excessively high sparsity levels. Therefore, MiHC robustly adapts to diverse phylogenetic relevance and sparsity levels. Results Our simulations show that MiHC maintains a high power at different phylogenetic relevance and sparsity levels with correct type I error controls. We also apply MiHC to four real microbiome datasets to test the association between respiratory tract microbiome and smoking status, the association between the infant’s gut microbiome and delivery mode, the association between the gut microbiome and type 1 diabetes status, and the association between the gut microbiome and human immunodeficiency virus status. Conclusions In practice, the true underlying association pattern on the extent of phylogenetic relevance and sparsity is usually unknown. Therefore, MiHC can be a useful analytic tool because of its high adaptivity to diverse phylogenetic relevance and sparsity levels. MiHC can be implemented in the R computing environment using our software package freely available at https://github.com/hk1785/MiHC .http://link.springer.com/article/10.1186/s40168-020-00834-9Microbiome association studiesMicrobial ecologyAdaptive association analysisHigher criticismSparse microbial associationsPhylogenetics
collection DOAJ
language English
format Article
sources DOAJ
author Hyunwook Koh
Ni Zhao
spellingShingle Hyunwook Koh
Ni Zhao
A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
Microbiome
Microbiome association studies
Microbial ecology
Adaptive association analysis
Higher criticism
Sparse microbial associations
Phylogenetics
author_facet Hyunwook Koh
Ni Zhao
author_sort Hyunwook Koh
title A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
title_short A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
title_full A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
title_fullStr A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
title_full_unstemmed A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
title_sort powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
publisher BMC
series Microbiome
issn 2049-2618
publishDate 2020-05-01
description Abstract Background In human microbiome studies, it is crucial to evaluate the association between microbial group (e.g., community or clade) composition and a host phenotype of interest. In response, a number of microbial group association tests have been proposed, which account for the unique features of the microbiome data (e.g., high-dimensionality, compositionality, phylogenetic relationship). These tests generally fall in the class of aggregation tests which amplify the overall group association by combining all the underlying microbial association signals, and, therefore, they are powerful when many microbial species are associated with a given host phenotype (i.e., low sparsity). However, in practice, the microbial association signals can be highly sparse, and this is especially the situation where we have a difficulty to discover the microbial group association. Methods Here, we introduce a powerful microbial group association test for sparse microbial association signals, namely, microbiome higher criticism analysis (MiHC). MiHC is a data-driven omnibus test taken in a search space spanned by tailoring the higher criticism test to incorporate phylogenetic information and/or modulate sparsity levels and including the Simes test for excessively high sparsity levels. Therefore, MiHC robustly adapts to diverse phylogenetic relevance and sparsity levels. Results Our simulations show that MiHC maintains a high power at different phylogenetic relevance and sparsity levels with correct type I error controls. We also apply MiHC to four real microbiome datasets to test the association between respiratory tract microbiome and smoking status, the association between the infant’s gut microbiome and delivery mode, the association between the gut microbiome and type 1 diabetes status, and the association between the gut microbiome and human immunodeficiency virus status. Conclusions In practice, the true underlying association pattern on the extent of phylogenetic relevance and sparsity is usually unknown. Therefore, MiHC can be a useful analytic tool because of its high adaptivity to diverse phylogenetic relevance and sparsity levels. MiHC can be implemented in the R computing environment using our software package freely available at https://github.com/hk1785/MiHC .
topic Microbiome association studies
Microbial ecology
Adaptive association analysis
Higher criticism
Sparse microbial associations
Phylogenetics
url http://link.springer.com/article/10.1186/s40168-020-00834-9
work_keys_str_mv AT hyunwookkoh apowerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals
AT nizhao apowerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals
AT hyunwookkoh powerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals
AT nizhao powerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals
_version_ 1724606166860824576