The Impact of Equilibrium Assumptions on Tests of Selection

With the increasing availability and quality of whole genome population data, various methodologies of population genetic inference are being utilized in order to identify and quantify recent population-level selective events. Though there has been a great proliferation of such methodology, the type...

Full description

Bibliographic Details
Main Authors: Jessica L. Crisci, Yu-Ping ePoh, Shivani eMahajan, Jeffrey D. Jensen
Format: Article
Language:English
Published: Frontiers Media S.A. 2013-11-01
Series:Frontiers in Genetics
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fgene.2013.00235/full
id doaj-8d27a277292446c99dfeb119e09efa48
record_format Article
spelling doaj-8d27a277292446c99dfeb119e09efa482020-11-24T22:54:14ZengFrontiers Media S.A.Frontiers in Genetics1664-80212013-11-01410.3389/fgene.2013.0023564823The Impact of Equilibrium Assumptions on Tests of SelectionJessica L. Crisci0Jessica L. Crisci1Yu-Ping ePoh2Yu-Ping ePoh3Shivani eMahajan4Shivani eMahajan5Jeffrey D. Jensen6Jeffrey D. Jensen7Swiss Institute of Bioinformatics (SIB)University of Massachusetts Medical SchoolUniversity of Massachusetts Medical SchoolSwiss Institute of Bioinformatics (SIB)École Polytechnique Fédérale de Lausanne (EPFL)Swiss Institute of Bioinformatics (SIB)École Polytechnique Fédérale de Lausanne (EPFL)Swiss Institute of Bioinformatics (SIB)With the increasing availability and quality of whole genome population data, various methodologies of population genetic inference are being utilized in order to identify and quantify recent population-level selective events. Though there has been a great proliferation of such methodology, the type-I and type-II error rates of many proposed statistics have not been well-described. Moreover, the performance of these statistics is often not evaluated for different biologically relevant scenarios (e.g., population size change, population structure), nor for the effect of differing data sizes (i.e., genomic vs. sub-genomic). The absence of the above information makes it difficult to evaluate newly available statistics relative to one another, and thus difficult to choose the proper toolset for a given empirical analysis. Thus, we here describe and compare the performance of four widely used tests of selection: SweepFinder, SweeD, OmegaPlus, and iHS. In order to consider the above questions, we utilize simulated data spanning a variety of selection coefficients and beneficial mutation rates. We demonstrate that the LD-based OmegaPlus performs best in terms of power to reject the neutral model under both equilibrium and non-equilibrium conditions – an important result regarding the relative effectiveness of linkage disequilibrium relative to site frequency spectrum based statics. The results presented here ought to serve as a useful guide for future empirical studies, and provides a guide for statistical choice depending on the history of the population under consideration. Moreover, the parameter space investigated and the Type-I and Type-II error rates calculated, represent a natural benchmark by which future statistics may be assessed.http://journal.frontiersin.org/Journal/10.3389/fgene.2013.00235/fullDemographysimulationPopulation Geneticspositive selectionstatistical inference
collection DOAJ
language English
format Article
sources DOAJ
author Jessica L. Crisci
Jessica L. Crisci
Yu-Ping ePoh
Yu-Ping ePoh
Shivani eMahajan
Shivani eMahajan
Jeffrey D. Jensen
Jeffrey D. Jensen
spellingShingle Jessica L. Crisci
Jessica L. Crisci
Yu-Ping ePoh
Yu-Ping ePoh
Shivani eMahajan
Shivani eMahajan
Jeffrey D. Jensen
Jeffrey D. Jensen
The Impact of Equilibrium Assumptions on Tests of Selection
Frontiers in Genetics
Demography
simulation
Population Genetics
positive selection
statistical inference
author_facet Jessica L. Crisci
Jessica L. Crisci
Yu-Ping ePoh
Yu-Ping ePoh
Shivani eMahajan
Shivani eMahajan
Jeffrey D. Jensen
Jeffrey D. Jensen
author_sort Jessica L. Crisci
title The Impact of Equilibrium Assumptions on Tests of Selection
title_short The Impact of Equilibrium Assumptions on Tests of Selection
title_full The Impact of Equilibrium Assumptions on Tests of Selection
title_fullStr The Impact of Equilibrium Assumptions on Tests of Selection
title_full_unstemmed The Impact of Equilibrium Assumptions on Tests of Selection
title_sort impact of equilibrium assumptions on tests of selection
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2013-11-01
description With the increasing availability and quality of whole genome population data, various methodologies of population genetic inference are being utilized in order to identify and quantify recent population-level selective events. Though there has been a great proliferation of such methodology, the type-I and type-II error rates of many proposed statistics have not been well-described. Moreover, the performance of these statistics is often not evaluated for different biologically relevant scenarios (e.g., population size change, population structure), nor for the effect of differing data sizes (i.e., genomic vs. sub-genomic). The absence of the above information makes it difficult to evaluate newly available statistics relative to one another, and thus difficult to choose the proper toolset for a given empirical analysis. Thus, we here describe and compare the performance of four widely used tests of selection: SweepFinder, SweeD, OmegaPlus, and iHS. In order to consider the above questions, we utilize simulated data spanning a variety of selection coefficients and beneficial mutation rates. We demonstrate that the LD-based OmegaPlus performs best in terms of power to reject the neutral model under both equilibrium and non-equilibrium conditions – an important result regarding the relative effectiveness of linkage disequilibrium relative to site frequency spectrum based statics. The results presented here ought to serve as a useful guide for future empirical studies, and provides a guide for statistical choice depending on the history of the population under consideration. Moreover, the parameter space investigated and the Type-I and Type-II error rates calculated, represent a natural benchmark by which future statistics may be assessed.
topic Demography
simulation
Population Genetics
positive selection
statistical inference
url http://journal.frontiersin.org/Journal/10.3389/fgene.2013.00235/full
work_keys_str_mv AT jessicalcrisci theimpactofequilibriumassumptionsontestsofselection
AT jessicalcrisci theimpactofequilibriumassumptionsontestsofselection
AT yupingepoh theimpactofequilibriumassumptionsontestsofselection
AT yupingepoh theimpactofequilibriumassumptionsontestsofselection
AT shivaniemahajan theimpactofequilibriumassumptionsontestsofselection
AT shivaniemahajan theimpactofequilibriumassumptionsontestsofselection
AT jeffreydjensen theimpactofequilibriumassumptionsontestsofselection
AT jeffreydjensen theimpactofequilibriumassumptionsontestsofselection
AT jessicalcrisci impactofequilibriumassumptionsontestsofselection
AT jessicalcrisci impactofequilibriumassumptionsontestsofselection
AT yupingepoh impactofequilibriumassumptionsontestsofselection
AT yupingepoh impactofequilibriumassumptionsontestsofselection
AT shivaniemahajan impactofequilibriumassumptionsontestsofselection
AT shivaniemahajan impactofequilibriumassumptionsontestsofselection
AT jeffreydjensen impactofequilibriumassumptionsontestsofselection
AT jeffreydjensen impactofequilibriumassumptionsontestsofselection
_version_ 1725661361013260288