Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.

Given the expanding availability of scientific data and tools to analyze them, combining different assessments of the same piece of information has become increasingly important for social, biological, and even physical sciences. This task demands, to begin with, a method-independent standard, such...

Full description

Bibliographic Details
Main Authors: Gelio Alves, Yi-Kuo Yu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2011-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3166143?pdf=render
id doaj-3474a0020ea34c199caca8bdc5a7b233
record_format Article
spelling doaj-3474a0020ea34c199caca8bdc5a7b2332020-11-25T01:24:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032011-01-0168e2264710.1371/journal.pone.0022647Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.Gelio AlvesYi-Kuo YuGiven the expanding availability of scientific data and tools to analyze them, combining different assessments of the same piece of information has become increasingly important for social, biological, and even physical sciences. This task demands, to begin with, a method-independent standard, such as the P-value, that can be used to assess the reliability of a piece of information. Good's formula and Fisher's method combine independent P-values with respectively unequal and equal weights. Both approaches may be regarded as limiting instances of a general case of combining P-values from m groups; P-values within each group are weighted equally, while weight varies by group. When some of the weights become nearly degenerate, as cautioned by Good, numeric instability occurs in computation of the combined P-values. We deal explicitly with this difficulty by deriving a controlled expansion, in powers of differences in inverse weights, that provides both accurate statistics and stable numerics. We illustrate the utility of this systematic approach with a few examples. In addition, we also provide here an alternative derivation for the probability distribution function of the general case and show how the analytic formula obtained reduces to both Good's and Fisher's methods as special cases. A C++ program, which computes the combined P-values with equal numerical stability regardless of whether weights are (nearly) degenerate or not, is available for download at our group website http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/CoinedPValues.html.http://europepmc.org/articles/PMC3166143?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Gelio Alves
Yi-Kuo Yu
spellingShingle Gelio Alves
Yi-Kuo Yu
Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
PLoS ONE
author_facet Gelio Alves
Yi-Kuo Yu
author_sort Gelio Alves
title Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_short Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_full Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_fullStr Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_full_unstemmed Combining independent, weighted P-values: achieving computational stability by a systematic expansion with controllable accuracy.
title_sort combining independent, weighted p-values: achieving computational stability by a systematic expansion with controllable accuracy.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2011-01-01
description Given the expanding availability of scientific data and tools to analyze them, combining different assessments of the same piece of information has become increasingly important for social, biological, and even physical sciences. This task demands, to begin with, a method-independent standard, such as the P-value, that can be used to assess the reliability of a piece of information. Good's formula and Fisher's method combine independent P-values with respectively unequal and equal weights. Both approaches may be regarded as limiting instances of a general case of combining P-values from m groups; P-values within each group are weighted equally, while weight varies by group. When some of the weights become nearly degenerate, as cautioned by Good, numeric instability occurs in computation of the combined P-values. We deal explicitly with this difficulty by deriving a controlled expansion, in powers of differences in inverse weights, that provides both accurate statistics and stable numerics. We illustrate the utility of this systematic approach with a few examples. In addition, we also provide here an alternative derivation for the probability distribution function of the general case and show how the analytic formula obtained reduces to both Good's and Fisher's methods as special cases. A C++ program, which computes the combined P-values with equal numerical stability regardless of whether weights are (nearly) degenerate or not, is available for download at our group website http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads/CoinedPValues.html.
url http://europepmc.org/articles/PMC3166143?pdf=render
work_keys_str_mv AT gelioalves combiningindependentweightedpvaluesachievingcomputationalstabilitybyasystematicexpansionwithcontrollableaccuracy
AT yikuoyu combiningindependentweightedpvaluesachievingcomputationalstabilitybyasystematicexpansionwithcontrollableaccuracy
_version_ 1725119353833127936