Summary: | <p>Abstract</p> <p>Background</p> <p>Combining multiple independent tests, when all test the same hypothesis and in the same direction, has been the subject of several approaches. Besides the inappropriate (in this case) Bonferroni procedure, the Fisher's method has been widely used, in particular in population genetics. This last method has nevertheless been challenged by the SGM (symmetry around the geometric mean) and Stouffer's <it>Z</it>-transformed methods that are less sensitive to asymmetry and deviations from uniformity of the distribution of the partial <it>P</it>-values. Performances of these different procedures were never compared on proportional data such as those currently used in population genetics.</p> <p>Results</p> <p>We present new software that implements a more recent method, the generalised binomial procedure, which tests for the deviation of the observed proportion of <it>P</it>-values lying under a chosen threshold from the expected proportion of such <it>P</it>-values under the null hypothesis. The respective performances of all available procedures were evaluated using simulated data under the null hypothesis with standard <it>P</it>-values distribution (differentiation tests). All procedures more or less behaved consistently with ~5% significant tests at <it>α </it>= 0.05. Then, linkage disequilibrium tests with increasing signal strength (rate of clonal reproduction), known to generate highly non-standard <it>P</it>-value distributions are undertaken and finally real population genetics data are analysed. In these cases, all procedures appear, more or less equally, very conservative, though SGM seems slightly more conservative.</p> <p>Conclusion</p> <p>Based on our results and those discussed in the literature we conclude that the generalised binomial and Stouffer's <it>Z </it>procedures should be preferred and <it>Z </it>when the number of tests is very small. The more conservative SGM might still be appropriate for meta-analyses when a strong publication bias in favour of significant results is expected to inflate type 2 error.</p>
|