Powerful p-value combination methods to detect incomplete association

Abstract Meta-analyses increase statistical power by combining statistics from multiple studies. Meta-analysis methods have mostly been evaluated under the condition that all the data in each study have an association with the given phenotype. However, specific experimental conditions in each study...

Full description

Bibliographic Details
Main Authors: Sora Yoon, Bukyung Baik, Taesung Park, Dougu Nam
Format: Article
Language:English
Published: Nature Publishing Group 2021-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-86465-y
id doaj-855cb6e277fa4760b73d3ae5f6fd7226
record_format Article
spelling doaj-855cb6e277fa4760b73d3ae5f6fd72262021-03-28T11:31:57ZengNature Publishing GroupScientific Reports2045-23222021-03-0111111110.1038/s41598-021-86465-yPowerful p-value combination methods to detect incomplete associationSora Yoon0Bukyung Baik1Taesung Park2Dougu Nam3Department of Biological Sciences, Ulsan National Institute of Science and TechnologyDepartment of Biological Sciences, Ulsan National Institute of Science and TechnologyDepartment of Statistics, Seoul National UniversityDepartment of Biological Sciences, Ulsan National Institute of Science and TechnologyAbstract Meta-analyses increase statistical power by combining statistics from multiple studies. Meta-analysis methods have mostly been evaluated under the condition that all the data in each study have an association with the given phenotype. However, specific experimental conditions in each study or genetic heterogeneity can result in “unassociated statistics” that are derived from the null distribution. Here, we show that power of conventional meta-analysis methods rapidly decreases as an increasing number of unassociated statistics are included, whereas the classical Fisher’s method and its weighted variant (wFisher) exhibit relatively high power that is robust to addition of unassociated statistics. We also propose another robust method based on joint distribution of ordered p-values (ordmeta). Simulation analyses for t-test, RNA-seq, and microarray data demonstrated that wFisher and ordmeta, when only a small number of studies have an association, outperformed existing meta-analysis methods. We performed meta-analyses of nine microarray datasets (prostate cancer) and four association summary datasets (body mass index), where our methods exhibited high biological relevance and were able to detect genes that the-state-of-the-art methods missed. The metapro R package that implements the proposed methods is available from both CRAN and GitHub ( http://github.com/unistbig/metapro ).https://doi.org/10.1038/s41598-021-86465-y
collection DOAJ
language English
format Article
sources DOAJ
author Sora Yoon
Bukyung Baik
Taesung Park
Dougu Nam
spellingShingle Sora Yoon
Bukyung Baik
Taesung Park
Dougu Nam
Powerful p-value combination methods to detect incomplete association
Scientific Reports
author_facet Sora Yoon
Bukyung Baik
Taesung Park
Dougu Nam
author_sort Sora Yoon
title Powerful p-value combination methods to detect incomplete association
title_short Powerful p-value combination methods to detect incomplete association
title_full Powerful p-value combination methods to detect incomplete association
title_fullStr Powerful p-value combination methods to detect incomplete association
title_full_unstemmed Powerful p-value combination methods to detect incomplete association
title_sort powerful p-value combination methods to detect incomplete association
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-03-01
description Abstract Meta-analyses increase statistical power by combining statistics from multiple studies. Meta-analysis methods have mostly been evaluated under the condition that all the data in each study have an association with the given phenotype. However, specific experimental conditions in each study or genetic heterogeneity can result in “unassociated statistics” that are derived from the null distribution. Here, we show that power of conventional meta-analysis methods rapidly decreases as an increasing number of unassociated statistics are included, whereas the classical Fisher’s method and its weighted variant (wFisher) exhibit relatively high power that is robust to addition of unassociated statistics. We also propose another robust method based on joint distribution of ordered p-values (ordmeta). Simulation analyses for t-test, RNA-seq, and microarray data demonstrated that wFisher and ordmeta, when only a small number of studies have an association, outperformed existing meta-analysis methods. We performed meta-analyses of nine microarray datasets (prostate cancer) and four association summary datasets (body mass index), where our methods exhibited high biological relevance and were able to detect genes that the-state-of-the-art methods missed. The metapro R package that implements the proposed methods is available from both CRAN and GitHub ( http://github.com/unistbig/metapro ).
url https://doi.org/10.1038/s41598-021-86465-y
work_keys_str_mv AT sorayoon powerfulpvaluecombinationmethodstodetectincompleteassociation
AT bukyungbaik powerfulpvaluecombinationmethodstodetectincompleteassociation
AT taesungpark powerfulpvaluecombinationmethodstodetectincompleteassociation
AT dougunam powerfulpvaluecombinationmethodstodetectincompleteassociation
_version_ 1724199869444259840