Powerful p-value combination methods to detect incomplete association
Abstract Meta-analyses increase statistical power by combining statistics from multiple studies. Meta-analysis methods have mostly been evaluated under the condition that all the data in each study have an association with the given phenotype. However, specific experimental conditions in each study...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-03-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-86465-y |
id |
doaj-855cb6e277fa4760b73d3ae5f6fd7226 |
---|---|
record_format |
Article |
spelling |
doaj-855cb6e277fa4760b73d3ae5f6fd72262021-03-28T11:31:57ZengNature Publishing GroupScientific Reports2045-23222021-03-0111111110.1038/s41598-021-86465-yPowerful p-value combination methods to detect incomplete associationSora Yoon0Bukyung Baik1Taesung Park2Dougu Nam3Department of Biological Sciences, Ulsan National Institute of Science and TechnologyDepartment of Biological Sciences, Ulsan National Institute of Science and TechnologyDepartment of Statistics, Seoul National UniversityDepartment of Biological Sciences, Ulsan National Institute of Science and TechnologyAbstract Meta-analyses increase statistical power by combining statistics from multiple studies. Meta-analysis methods have mostly been evaluated under the condition that all the data in each study have an association with the given phenotype. However, specific experimental conditions in each study or genetic heterogeneity can result in “unassociated statistics” that are derived from the null distribution. Here, we show that power of conventional meta-analysis methods rapidly decreases as an increasing number of unassociated statistics are included, whereas the classical Fisher’s method and its weighted variant (wFisher) exhibit relatively high power that is robust to addition of unassociated statistics. We also propose another robust method based on joint distribution of ordered p-values (ordmeta). Simulation analyses for t-test, RNA-seq, and microarray data demonstrated that wFisher and ordmeta, when only a small number of studies have an association, outperformed existing meta-analysis methods. We performed meta-analyses of nine microarray datasets (prostate cancer) and four association summary datasets (body mass index), where our methods exhibited high biological relevance and were able to detect genes that the-state-of-the-art methods missed. The metapro R package that implements the proposed methods is available from both CRAN and GitHub ( http://github.com/unistbig/metapro ).https://doi.org/10.1038/s41598-021-86465-y |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sora Yoon Bukyung Baik Taesung Park Dougu Nam |
spellingShingle |
Sora Yoon Bukyung Baik Taesung Park Dougu Nam Powerful p-value combination methods to detect incomplete association Scientific Reports |
author_facet |
Sora Yoon Bukyung Baik Taesung Park Dougu Nam |
author_sort |
Sora Yoon |
title |
Powerful p-value combination methods to detect incomplete association |
title_short |
Powerful p-value combination methods to detect incomplete association |
title_full |
Powerful p-value combination methods to detect incomplete association |
title_fullStr |
Powerful p-value combination methods to detect incomplete association |
title_full_unstemmed |
Powerful p-value combination methods to detect incomplete association |
title_sort |
powerful p-value combination methods to detect incomplete association |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-03-01 |
description |
Abstract Meta-analyses increase statistical power by combining statistics from multiple studies. Meta-analysis methods have mostly been evaluated under the condition that all the data in each study have an association with the given phenotype. However, specific experimental conditions in each study or genetic heterogeneity can result in “unassociated statistics” that are derived from the null distribution. Here, we show that power of conventional meta-analysis methods rapidly decreases as an increasing number of unassociated statistics are included, whereas the classical Fisher’s method and its weighted variant (wFisher) exhibit relatively high power that is robust to addition of unassociated statistics. We also propose another robust method based on joint distribution of ordered p-values (ordmeta). Simulation analyses for t-test, RNA-seq, and microarray data demonstrated that wFisher and ordmeta, when only a small number of studies have an association, outperformed existing meta-analysis methods. We performed meta-analyses of nine microarray datasets (prostate cancer) and four association summary datasets (body mass index), where our methods exhibited high biological relevance and were able to detect genes that the-state-of-the-art methods missed. The metapro R package that implements the proposed methods is available from both CRAN and GitHub ( http://github.com/unistbig/metapro ). |
url |
https://doi.org/10.1038/s41598-021-86465-y |
work_keys_str_mv |
AT sorayoon powerfulpvaluecombinationmethodstodetectincompleteassociation AT bukyungbaik powerfulpvaluecombinationmethodstodetectincompleteassociation AT taesungpark powerfulpvaluecombinationmethodstodetectincompleteassociation AT dougunam powerfulpvaluecombinationmethodstodetectincompleteassociation |
_version_ |
1724199869444259840 |