Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
Background: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological func...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wellcome
2017-07-01
|
Series: | Wellcome Open Research |
Subjects: | |
Online Access: | https://wellcomeopenresearch.org/articles/2-54/v1 |
id |
doaj-710f8dccce39475eb0aa4981509575d1 |
---|---|
record_format |
Article |
spelling |
doaj-710f8dccce39475eb0aa4981509575d12020-11-24T22:52:25ZengWellcomeWellcome Open Research2398-502X2017-07-01210.12688/wellcomeopenres.11926.112891Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]Richard Howey0Heather J. Cordell1Institute of Genetic Medicine, Newcastle University, Newcastle upon Tyne, NE1 3BZ, UKInstitute of Genetic Medicine, Newcastle University, Newcastle upon Tyne, NE1 3BZ, UKBackground: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological function. Network analysis indicated that the implicated genes formed two separate interaction networks, each containing genes highly related to autism and neurodegenerative disorders. Methods: Here we investigate further the properties and performance of the W-test via theoretical evaluation, computer simulations and application to real data. Results: We demonstrate that, for common variants, the W-test is closely related to several existing tests of association allowing for interaction, including logistic regression on 8 degrees of freedom, although logistic regression can show inflated type I error for low minor allele frequencies, whereas the W-test shows good/conservative type I error control. Although in some situations the W-test can show higher power, logistic regression is not limited to tests on 8 degrees of freedom but can instead be taylored to impose greater structure on the assumed alternative hypothesis, offering a power advantage when the imposed structure matches the true structure. Conclusions: The W-test is a potentially useful method for testing for association - without necessarily implying interaction - between genetic variants disease, particularly when one or more of the genetic variants are rare. For common variants, the advantages of the W-test are less clear, and, indeed, there are situations where existing methods perform better. In our investigations, we further uncover a number of problems with the practical implementation and application of the W-test (to bipolar disorder) previously described, apparently due to inadequate use of standard data quality-control procedures. This observation leads us to urge caution in interpretation of the previously-presented results, most of which we consider are highly likely to be artefacts.https://wellcomeopenresearch.org/articles/2-54/v1BioinformaticsGenomicsStatistical Methodologies & Health Informatics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Richard Howey Heather J. Cordell |
spellingShingle |
Richard Howey Heather J. Cordell Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved] Wellcome Open Research Bioinformatics Genomics Statistical Methodologies & Health Informatics |
author_facet |
Richard Howey Heather J. Cordell |
author_sort |
Richard Howey |
title |
Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved] |
title_short |
Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved] |
title_full |
Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved] |
title_fullStr |
Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved] |
title_full_unstemmed |
Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved] |
title_sort |
further investigations of the w-test for pairwise epistasis testing [version 1; referees: 2 approved] |
publisher |
Wellcome |
series |
Wellcome Open Research |
issn |
2398-502X |
publishDate |
2017-07-01 |
description |
Background: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological function. Network analysis indicated that the implicated genes formed two separate interaction networks, each containing genes highly related to autism and neurodegenerative disorders. Methods: Here we investigate further the properties and performance of the W-test via theoretical evaluation, computer simulations and application to real data. Results: We demonstrate that, for common variants, the W-test is closely related to several existing tests of association allowing for interaction, including logistic regression on 8 degrees of freedom, although logistic regression can show inflated type I error for low minor allele frequencies, whereas the W-test shows good/conservative type I error control. Although in some situations the W-test can show higher power, logistic regression is not limited to tests on 8 degrees of freedom but can instead be taylored to impose greater structure on the assumed alternative hypothesis, offering a power advantage when the imposed structure matches the true structure. Conclusions: The W-test is a potentially useful method for testing for association - without necessarily implying interaction - between genetic variants disease, particularly when one or more of the genetic variants are rare. For common variants, the advantages of the W-test are less clear, and, indeed, there are situations where existing methods perform better. In our investigations, we further uncover a number of problems with the practical implementation and application of the W-test (to bipolar disorder) previously described, apparently due to inadequate use of standard data quality-control procedures. This observation leads us to urge caution in interpretation of the previously-presented results, most of which we consider are highly likely to be artefacts. |
topic |
Bioinformatics Genomics Statistical Methodologies & Health Informatics |
url |
https://wellcomeopenresearch.org/articles/2-54/v1 |
work_keys_str_mv |
AT richardhowey furtherinvestigationsofthewtestforpairwiseepistasistestingversion1referees2approved AT heatherjcordell furtherinvestigationsofthewtestforpairwiseepistasistestingversion1referees2approved |
_version_ |
1725666337874771968 |