An entropy test for single-locus genetic association analysis

<p>Abstract</p> <p>Background</p> <p>The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding t...

Full description

Bibliographic Details
Main Authors: González-Pérez Antonio, Romo-Astorga Alejandro, Susillo-González Juan, Cordoba José, Matilla-García Mariano, Ruiz-Marín Manuel, Ruiz Agustín, Gayán Javier
Format: Article
Language:English
Published: BMC 2010-03-01
Series:BMC Genetics
Online Access:http://www.biomedcentral.com/1471-2156/11/19
id doaj-d45c9637dec1448584922d250ec14539
record_format Article
spelling doaj-d45c9637dec1448584922d250ec145392020-11-25T03:55:10ZengBMCBMC Genetics1471-21562010-03-011111910.1186/1471-2156-11-19An entropy test for single-locus genetic association analysisGonzález-Pérez AntonioRomo-Astorga AlejandroSusillo-González JuanCordoba JoséMatilla-García MarianoRuiz-Marín ManuelRuiz AgustínGayán Javier<p>Abstract</p> <p>Background</p> <p>The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding task. Clearly, there is a need for more powerful tests of genetic association, and especially for the identification of rare effects</p> <p>Results</p> <p>We introduce a new genetic association test based on symbolic dynamics and symbolic entropy. Using a freely available software, we have applied this entropy test, and a conventional test, to simulated and real datasets, to illustrate the method and estimate type I error and power. We have also compared this new entropy test to the Fisher exact test for assessment of association with low-frequency SNPs. The entropy test is generally more powerful than the conventional test, and can be significantly more powerful when the genotypic test is applied to low allele-frequency markers. We have also shown that both the Fisher and Entropy methods are optimal to test for association with low-frequency SNPs (MAF around 1-5%), and both are conservative for very rare SNPs (MAF<1%)</p> <p>Conclusions</p> <p>We have developed a new, simple, consistent and powerful test to detect genetic association of biallelic/SNP markers in case-control data, by using symbolic dynamics and symbolic entropy as a measure of gene dependence. We also provide a standard asymptotic distribution of this test statistic. Given that the test is based on entropy measures, it avoids smoothed nonparametric estimation. The entropy test is generally as good or even more powerful than the conventional and Fisher tests. Furthermore, the entropy test is more computationally efficient than the Fisher's Exact test, especially for large number of markers. Therefore, this entropy-based test has the advantage of being optimal for most SNPs, regardless of their allele frequency (Minor Allele Frequency (MAF) between 1-50%). This property is quite beneficial, since many researchers tend to discard low allele-frequency SNPs from their analysis. Now they can apply the same statistical test of association to all SNPs in a single analysis., which can be especially helpful to detect rare effects.</p> http://www.biomedcentral.com/1471-2156/11/19
collection DOAJ
language English
format Article
sources DOAJ
author González-Pérez Antonio
Romo-Astorga Alejandro
Susillo-González Juan
Cordoba José
Matilla-García Mariano
Ruiz-Marín Manuel
Ruiz Agustín
Gayán Javier
spellingShingle González-Pérez Antonio
Romo-Astorga Alejandro
Susillo-González Juan
Cordoba José
Matilla-García Mariano
Ruiz-Marín Manuel
Ruiz Agustín
Gayán Javier
An entropy test for single-locus genetic association analysis
BMC Genetics
author_facet González-Pérez Antonio
Romo-Astorga Alejandro
Susillo-González Juan
Cordoba José
Matilla-García Mariano
Ruiz-Marín Manuel
Ruiz Agustín
Gayán Javier
author_sort González-Pérez Antonio
title An entropy test for single-locus genetic association analysis
title_short An entropy test for single-locus genetic association analysis
title_full An entropy test for single-locus genetic association analysis
title_fullStr An entropy test for single-locus genetic association analysis
title_full_unstemmed An entropy test for single-locus genetic association analysis
title_sort entropy test for single-locus genetic association analysis
publisher BMC
series BMC Genetics
issn 1471-2156
publishDate 2010-03-01
description <p>Abstract</p> <p>Background</p> <p>The etiology of complex diseases is due to the combination of genetic and environmental factors, usually many of them, and each with a small effect. The identification of these small-effect contributing factors is still a demanding task. Clearly, there is a need for more powerful tests of genetic association, and especially for the identification of rare effects</p> <p>Results</p> <p>We introduce a new genetic association test based on symbolic dynamics and symbolic entropy. Using a freely available software, we have applied this entropy test, and a conventional test, to simulated and real datasets, to illustrate the method and estimate type I error and power. We have also compared this new entropy test to the Fisher exact test for assessment of association with low-frequency SNPs. The entropy test is generally more powerful than the conventional test, and can be significantly more powerful when the genotypic test is applied to low allele-frequency markers. We have also shown that both the Fisher and Entropy methods are optimal to test for association with low-frequency SNPs (MAF around 1-5%), and both are conservative for very rare SNPs (MAF<1%)</p> <p>Conclusions</p> <p>We have developed a new, simple, consistent and powerful test to detect genetic association of biallelic/SNP markers in case-control data, by using symbolic dynamics and symbolic entropy as a measure of gene dependence. We also provide a standard asymptotic distribution of this test statistic. Given that the test is based on entropy measures, it avoids smoothed nonparametric estimation. The entropy test is generally as good or even more powerful than the conventional and Fisher tests. Furthermore, the entropy test is more computationally efficient than the Fisher's Exact test, especially for large number of markers. Therefore, this entropy-based test has the advantage of being optimal for most SNPs, regardless of their allele frequency (Minor Allele Frequency (MAF) between 1-50%). This property is quite beneficial, since many researchers tend to discard low allele-frequency SNPs from their analysis. Now they can apply the same statistical test of association to all SNPs in a single analysis., which can be especially helpful to detect rare effects.</p>
url http://www.biomedcentral.com/1471-2156/11/19
work_keys_str_mv AT gonzalezperezantonio anentropytestforsinglelocusgeneticassociationanalysis
AT romoastorgaalejandro anentropytestforsinglelocusgeneticassociationanalysis
AT susillogonzalezjuan anentropytestforsinglelocusgeneticassociationanalysis
AT cordobajose anentropytestforsinglelocusgeneticassociationanalysis
AT matillagarciamariano anentropytestforsinglelocusgeneticassociationanalysis
AT ruizmarinmanuel anentropytestforsinglelocusgeneticassociationanalysis
AT ruizagustin anentropytestforsinglelocusgeneticassociationanalysis
AT gayanjavier anentropytestforsinglelocusgeneticassociationanalysis
AT gonzalezperezantonio entropytestforsinglelocusgeneticassociationanalysis
AT romoastorgaalejandro entropytestforsinglelocusgeneticassociationanalysis
AT susillogonzalezjuan entropytestforsinglelocusgeneticassociationanalysis
AT cordobajose entropytestforsinglelocusgeneticassociationanalysis
AT matillagarciamariano entropytestforsinglelocusgeneticassociationanalysis
AT ruizmarinmanuel entropytestforsinglelocusgeneticassociationanalysis
AT ruizagustin entropytestforsinglelocusgeneticassociationanalysis
AT gayanjavier entropytestforsinglelocusgeneticassociationanalysis
_version_ 1724470317932347392