Comparative analysis of methods for detecting interacting loci

<p>Abstract</p> <p>Background</p> <p>Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because differe...

Full description

Bibliographic Details
Main Authors: Yuan Xiguo, Raghuram Jayaram, Guy Richard T, Miller David J, Langefeld Carl D, Yu Guoqiang, Chen Li, Herrington David M, Wang Yue
Format: Article
Language:English
Published: BMC 2011-07-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/12/344
id doaj-4fe98a96e72a4ffaacd5b8b8a8983d19
record_format Article
spelling doaj-4fe98a96e72a4ffaacd5b8b8a8983d192020-11-24T22:58:25ZengBMCBMC Genomics1471-21642011-07-0112134410.1186/1471-2164-12-344Comparative analysis of methods for detecting interacting lociYuan XiguoRaghuram JayaramGuy Richard TMiller David JLangefeld Carl DYu GuoqiangChen LiHerrington David MWang Yue<p>Abstract</p> <p>Background</p> <p>Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and experimental protocols were used in the papers introducing these methods and in subsequent studies. Moreover, there have been very few studies strictly focused on comparison of existing methods. Given the importance of detecting gene-gene and gene-environment interactions, a rigorous, comprehensive comparison of performance and limitations of available interaction detection methods is warranted.</p> <p>Results</p> <p>We report a comparison of eight representative methods, of which seven were specifically designed to detect interactions among single nucleotide polymorphisms (SNPs), with the last a popular main-effect testing method used as a baseline for performance evaluation. The selected methods, multifactor dimensionality reduction (MDR), full interaction model (FIM), information gain (IG), Bayesian epistasis association mapping (BEAM), SNP harvester (SH), maximum entropy conditional probability modeling (MECPM), logistic regression with an interaction term (LRIT), and logistic regression (LR) were compared on a large number of simulated data sets, each, consistent with complex disease models, embedding <it>multiple </it>sets of interacting SNPs, under different interaction models. The assessment criteria included several relevant detection power measures, family-wise type I error rate, and computational complexity. There are several important results from this study. First, while some SNPs in interactions with strong effects are successfully detected, most of the methods miss many interacting SNPs at an acceptable rate of false positives. In this study, the best-performing method was MECPM. Second, the statistical significance assessment criteria, used by some of the methods to control the type I error rate, are quite conservative, thereby limiting their power and making it difficult to fairly compare them. Third, as expected, power varies for different models and as a function of penetrance, minor allele frequency, linkage disequilibrium and marginal effects. Fourth, the analytical relationships between power and these factors are derived, aiding in the interpretation of the study results. Fifth, for these methods the magnitude of the main effect influences the power of the tests. Sixth, most methods can detect some ground-truth SNPs but have modest power to detect the whole set of interacting SNPs.</p> <p>Conclusion</p> <p>This comparison study provides new insights into the strengths and limitations of current methods for detecting interacting loci. This study, along with freely available simulation tools we provide, should help support development of improved methods. The simulation tools are available at: <url>http://code.google.com/p/simulation-tool-bmc-ms9169818735220977/downloads/list</url>.</p> http://www.biomedcentral.com/1471-2164/12/344
collection DOAJ
language English
format Article
sources DOAJ
author Yuan Xiguo
Raghuram Jayaram
Guy Richard T
Miller David J
Langefeld Carl D
Yu Guoqiang
Chen Li
Herrington David M
Wang Yue
spellingShingle Yuan Xiguo
Raghuram Jayaram
Guy Richard T
Miller David J
Langefeld Carl D
Yu Guoqiang
Chen Li
Herrington David M
Wang Yue
Comparative analysis of methods for detecting interacting loci
BMC Genomics
author_facet Yuan Xiguo
Raghuram Jayaram
Guy Richard T
Miller David J
Langefeld Carl D
Yu Guoqiang
Chen Li
Herrington David M
Wang Yue
author_sort Yuan Xiguo
title Comparative analysis of methods for detecting interacting loci
title_short Comparative analysis of methods for detecting interacting loci
title_full Comparative analysis of methods for detecting interacting loci
title_fullStr Comparative analysis of methods for detecting interacting loci
title_full_unstemmed Comparative analysis of methods for detecting interacting loci
title_sort comparative analysis of methods for detecting interacting loci
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2011-07-01
description <p>Abstract</p> <p>Background</p> <p>Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and experimental protocols were used in the papers introducing these methods and in subsequent studies. Moreover, there have been very few studies strictly focused on comparison of existing methods. Given the importance of detecting gene-gene and gene-environment interactions, a rigorous, comprehensive comparison of performance and limitations of available interaction detection methods is warranted.</p> <p>Results</p> <p>We report a comparison of eight representative methods, of which seven were specifically designed to detect interactions among single nucleotide polymorphisms (SNPs), with the last a popular main-effect testing method used as a baseline for performance evaluation. The selected methods, multifactor dimensionality reduction (MDR), full interaction model (FIM), information gain (IG), Bayesian epistasis association mapping (BEAM), SNP harvester (SH), maximum entropy conditional probability modeling (MECPM), logistic regression with an interaction term (LRIT), and logistic regression (LR) were compared on a large number of simulated data sets, each, consistent with complex disease models, embedding <it>multiple </it>sets of interacting SNPs, under different interaction models. The assessment criteria included several relevant detection power measures, family-wise type I error rate, and computational complexity. There are several important results from this study. First, while some SNPs in interactions with strong effects are successfully detected, most of the methods miss many interacting SNPs at an acceptable rate of false positives. In this study, the best-performing method was MECPM. Second, the statistical significance assessment criteria, used by some of the methods to control the type I error rate, are quite conservative, thereby limiting their power and making it difficult to fairly compare them. Third, as expected, power varies for different models and as a function of penetrance, minor allele frequency, linkage disequilibrium and marginal effects. Fourth, the analytical relationships between power and these factors are derived, aiding in the interpretation of the study results. Fifth, for these methods the magnitude of the main effect influences the power of the tests. Sixth, most methods can detect some ground-truth SNPs but have modest power to detect the whole set of interacting SNPs.</p> <p>Conclusion</p> <p>This comparison study provides new insights into the strengths and limitations of current methods for detecting interacting loci. This study, along with freely available simulation tools we provide, should help support development of improved methods. The simulation tools are available at: <url>http://code.google.com/p/simulation-tool-bmc-ms9169818735220977/downloads/list</url>.</p>
url http://www.biomedcentral.com/1471-2164/12/344
work_keys_str_mv AT yuanxiguo comparativeanalysisofmethodsfordetectinginteractingloci
AT raghuramjayaram comparativeanalysisofmethodsfordetectinginteractingloci
AT guyrichardt comparativeanalysisofmethodsfordetectinginteractingloci
AT millerdavidj comparativeanalysisofmethodsfordetectinginteractingloci
AT langefeldcarld comparativeanalysisofmethodsfordetectinginteractingloci
AT yuguoqiang comparativeanalysisofmethodsfordetectinginteractingloci
AT chenli comparativeanalysisofmethodsfordetectinginteractingloci
AT herringtondavidm comparativeanalysisofmethodsfordetectinginteractingloci
AT wangyue comparativeanalysisofmethodsfordetectinginteractingloci
_version_ 1725647185661394944