A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals
Abstract Background In human microbiome studies, it is crucial to evaluate the association between microbial group (e.g., community or clade) composition and a host phenotype of interest. In response, a number of microbial group association tests have been proposed, which account for the unique feat...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2020-05-01
|
Series: | Microbiome |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s40168-020-00834-9 |
id |
doaj-41773f3893db46a3955763c8a26e1224 |
---|---|
record_format |
Article |
spelling |
doaj-41773f3893db46a3955763c8a26e12242020-11-25T03:23:28ZengBMCMicrobiome2049-26182020-05-018111610.1186/s40168-020-00834-9A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signalsHyunwook Koh0Ni Zhao1Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins UniversityDepartment of Biostatistics, Bloomberg School of Public Health, Johns Hopkins UniversityAbstract Background In human microbiome studies, it is crucial to evaluate the association between microbial group (e.g., community or clade) composition and a host phenotype of interest. In response, a number of microbial group association tests have been proposed, which account for the unique features of the microbiome data (e.g., high-dimensionality, compositionality, phylogenetic relationship). These tests generally fall in the class of aggregation tests which amplify the overall group association by combining all the underlying microbial association signals, and, therefore, they are powerful when many microbial species are associated with a given host phenotype (i.e., low sparsity). However, in practice, the microbial association signals can be highly sparse, and this is especially the situation where we have a difficulty to discover the microbial group association. Methods Here, we introduce a powerful microbial group association test for sparse microbial association signals, namely, microbiome higher criticism analysis (MiHC). MiHC is a data-driven omnibus test taken in a search space spanned by tailoring the higher criticism test to incorporate phylogenetic information and/or modulate sparsity levels and including the Simes test for excessively high sparsity levels. Therefore, MiHC robustly adapts to diverse phylogenetic relevance and sparsity levels. Results Our simulations show that MiHC maintains a high power at different phylogenetic relevance and sparsity levels with correct type I error controls. We also apply MiHC to four real microbiome datasets to test the association between respiratory tract microbiome and smoking status, the association between the infant’s gut microbiome and delivery mode, the association between the gut microbiome and type 1 diabetes status, and the association between the gut microbiome and human immunodeficiency virus status. Conclusions In practice, the true underlying association pattern on the extent of phylogenetic relevance and sparsity is usually unknown. Therefore, MiHC can be a useful analytic tool because of its high adaptivity to diverse phylogenetic relevance and sparsity levels. MiHC can be implemented in the R computing environment using our software package freely available at https://github.com/hk1785/MiHC .http://link.springer.com/article/10.1186/s40168-020-00834-9Microbiome association studiesMicrobial ecologyAdaptive association analysisHigher criticismSparse microbial associationsPhylogenetics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hyunwook Koh Ni Zhao |
spellingShingle |
Hyunwook Koh Ni Zhao A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals Microbiome Microbiome association studies Microbial ecology Adaptive association analysis Higher criticism Sparse microbial associations Phylogenetics |
author_facet |
Hyunwook Koh Ni Zhao |
author_sort |
Hyunwook Koh |
title |
A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals |
title_short |
A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals |
title_full |
A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals |
title_fullStr |
A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals |
title_full_unstemmed |
A powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals |
title_sort |
powerful microbial group association test based on the higher criticism analysis for sparse microbial association signals |
publisher |
BMC |
series |
Microbiome |
issn |
2049-2618 |
publishDate |
2020-05-01 |
description |
Abstract Background In human microbiome studies, it is crucial to evaluate the association between microbial group (e.g., community or clade) composition and a host phenotype of interest. In response, a number of microbial group association tests have been proposed, which account for the unique features of the microbiome data (e.g., high-dimensionality, compositionality, phylogenetic relationship). These tests generally fall in the class of aggregation tests which amplify the overall group association by combining all the underlying microbial association signals, and, therefore, they are powerful when many microbial species are associated with a given host phenotype (i.e., low sparsity). However, in practice, the microbial association signals can be highly sparse, and this is especially the situation where we have a difficulty to discover the microbial group association. Methods Here, we introduce a powerful microbial group association test for sparse microbial association signals, namely, microbiome higher criticism analysis (MiHC). MiHC is a data-driven omnibus test taken in a search space spanned by tailoring the higher criticism test to incorporate phylogenetic information and/or modulate sparsity levels and including the Simes test for excessively high sparsity levels. Therefore, MiHC robustly adapts to diverse phylogenetic relevance and sparsity levels. Results Our simulations show that MiHC maintains a high power at different phylogenetic relevance and sparsity levels with correct type I error controls. We also apply MiHC to four real microbiome datasets to test the association between respiratory tract microbiome and smoking status, the association between the infant’s gut microbiome and delivery mode, the association between the gut microbiome and type 1 diabetes status, and the association between the gut microbiome and human immunodeficiency virus status. Conclusions In practice, the true underlying association pattern on the extent of phylogenetic relevance and sparsity is usually unknown. Therefore, MiHC can be a useful analytic tool because of its high adaptivity to diverse phylogenetic relevance and sparsity levels. MiHC can be implemented in the R computing environment using our software package freely available at https://github.com/hk1785/MiHC . |
topic |
Microbiome association studies Microbial ecology Adaptive association analysis Higher criticism Sparse microbial associations Phylogenetics |
url |
http://link.springer.com/article/10.1186/s40168-020-00834-9 |
work_keys_str_mv |
AT hyunwookkoh apowerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals AT nizhao apowerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals AT hyunwookkoh powerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals AT nizhao powerfulmicrobialgroupassociationtestbasedonthehighercriticismanalysisforsparsemicrobialassociationsignals |
_version_ |
1724606166860824576 |