Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]

Background: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological func...

Full description

Bibliographic Details
Main Authors: Richard Howey, Heather J. Cordell
Format: Article
Language:English
Published: Wellcome 2017-07-01
Series:Wellcome Open Research
Subjects:
Online Access:https://wellcomeopenresearch.org/articles/2-54/v1
id doaj-710f8dccce39475eb0aa4981509575d1
record_format Article
spelling doaj-710f8dccce39475eb0aa4981509575d12020-11-24T22:52:25ZengWellcomeWellcome Open Research2398-502X2017-07-01210.12688/wellcomeopenres.11926.112891Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]Richard Howey0Heather J. Cordell1Institute of Genetic Medicine, Newcastle University, Newcastle upon Tyne, NE1 3BZ, UKInstitute of Genetic Medicine, Newcastle University, Newcastle upon Tyne, NE1 3BZ, UKBackground: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological function. Network analysis indicated that the implicated genes formed two separate interaction networks, each containing genes highly related to autism and neurodegenerative disorders. Methods: Here we investigate further the properties and performance of the W-test via theoretical evaluation, computer simulations and application to real data. Results: We demonstrate that, for common variants, the W-test is closely related to several existing tests of association allowing for interaction, including logistic regression on 8 degrees of freedom, although logistic regression can show inflated type I error for low minor allele frequencies,  whereas the W-test shows good/conservative type I error control. Although in some situations the W-test can show higher power, logistic regression is not limited to tests on 8 degrees of freedom but can instead be taylored to impose greater structure on the assumed alternative hypothesis, offering a power advantage when the imposed structure matches the true structure. Conclusions: The W-test is a potentially useful method for testing for association - without necessarily implying interaction - between genetic variants disease, particularly when one or more of the genetic variants are rare. For common variants, the advantages of the W-test are less clear, and, indeed, there are situations where existing methods perform better. In our investigations, we further uncover a number of problems with the practical implementation and application of the W-test (to bipolar disorder) previously described, apparently due to inadequate use of standard data quality-control procedures. This observation leads us to urge caution in interpretation of the previously-presented results, most of which we consider are highly likely to be artefacts.https://wellcomeopenresearch.org/articles/2-54/v1BioinformaticsGenomicsStatistical Methodologies & Health Informatics
collection DOAJ
language English
format Article
sources DOAJ
author Richard Howey
Heather J. Cordell
spellingShingle Richard Howey
Heather J. Cordell
Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
Wellcome Open Research
Bioinformatics
Genomics
Statistical Methodologies & Health Informatics
author_facet Richard Howey
Heather J. Cordell
author_sort Richard Howey
title Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
title_short Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
title_full Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
title_fullStr Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
title_full_unstemmed Further investigations of the W-test for pairwise epistasis testing [version 1; referees: 2 approved]
title_sort further investigations of the w-test for pairwise epistasis testing [version 1; referees: 2 approved]
publisher Wellcome
series Wellcome Open Research
issn 2398-502X
publishDate 2017-07-01
description Background: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological function. Network analysis indicated that the implicated genes formed two separate interaction networks, each containing genes highly related to autism and neurodegenerative disorders. Methods: Here we investigate further the properties and performance of the W-test via theoretical evaluation, computer simulations and application to real data. Results: We demonstrate that, for common variants, the W-test is closely related to several existing tests of association allowing for interaction, including logistic regression on 8 degrees of freedom, although logistic regression can show inflated type I error for low minor allele frequencies,  whereas the W-test shows good/conservative type I error control. Although in some situations the W-test can show higher power, logistic regression is not limited to tests on 8 degrees of freedom but can instead be taylored to impose greater structure on the assumed alternative hypothesis, offering a power advantage when the imposed structure matches the true structure. Conclusions: The W-test is a potentially useful method for testing for association - without necessarily implying interaction - between genetic variants disease, particularly when one or more of the genetic variants are rare. For common variants, the advantages of the W-test are less clear, and, indeed, there are situations where existing methods perform better. In our investigations, we further uncover a number of problems with the practical implementation and application of the W-test (to bipolar disorder) previously described, apparently due to inadequate use of standard data quality-control procedures. This observation leads us to urge caution in interpretation of the previously-presented results, most of which we consider are highly likely to be artefacts.
topic Bioinformatics
Genomics
Statistical Methodologies & Health Informatics
url https://wellcomeopenresearch.org/articles/2-54/v1
work_keys_str_mv AT richardhowey furtherinvestigationsofthewtestforpairwiseepistasistestingversion1referees2approved
AT heatherjcordell furtherinvestigationsofthewtestforpairwiseepistasistestingversion1referees2approved
_version_ 1725666337874771968