Tests of Selection in Pooled Case-Control Data: An Empirical Study

For smaller organisms with faster breeding cycles, artificial selection can be used to create sub-populations with different phenotypic traits. Genetic tests can be employed to identify the causal markers for the phenotypes, as a precursor to engineering strains with a combination of traits. Traditi...

Full description

Bibliographic Details
Main Authors: Nitin eUdpa, Dan eZhou, Gabriel G Haddad, Vineet eBafna
Format: Article
Language:English
Published: Frontiers Media S.A. 2011-11-01
Series:Frontiers in Genetics
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fgene.2011.00083/full
id doaj-300f6c750e5846d588148c31dac9ba7d
record_format Article
spelling doaj-300f6c750e5846d588148c31dac9ba7d2020-11-24T22:54:21ZengFrontiers Media S.A.Frontiers in Genetics1664-80212011-11-01210.3389/fgene.2011.0008313406Tests of Selection in Pooled Case-Control Data: An Empirical StudyNitin eUdpa0Dan eZhou1Gabriel G Haddad2Gabriel G Haddad3Vineet eBafna4Vineet eBafna5UC San DiegoUC San DiegoUC San DiegoRady Children's HospitalUC San DiegoUC San DiegoFor smaller organisms with faster breeding cycles, artificial selection can be used to create sub-populations with different phenotypic traits. Genetic tests can be employed to identify the causal markers for the phenotypes, as a precursor to engineering strains with a combination of traits. Traditional approaches involve analyzing crosses of inbred strains to test for co-segregation with genetic markers. Here we take advantage of cheaper next generation sequencing techniques to identifygenetic signatures of adaptation to the selection constraints. Obtaining individual sequencing data is often unrealistic due to cost and sample issues, so we focus on pooled genomic data.In this paper, we explore a series of statistical tests for selection using pooled case (under selection) and control populations. Extensive simulations are used to show that these approaches work well for a wide range of population divergence times and strong selective pressures. We show that pooling does not have a significant impact on statistical power. The tests are also robust to reasonable variations in several different parameters, including window size, base-calling error rate, and sequencing coverage. We then demonstrate the viability (and the challenges) of one of these methods in two independent Drosophila populations (Drosophila melanogaster) bred under selectionfor hypoxia and accelerated development, respectively. Testing for extreme hypoxia tolerance showed clear signals of selection, pointing to loci that are important for hypoxia adaptation.Overall, we outline a strategy for finding regions under selection using pooled sequences, then devise optimal tests for that strategy. The approaches show promise for detecting selection, even several generations after fixation of the beneficial allele has occurred.http://journal.frontiersin.org/Journal/10.3389/fgene.2011.00083/fullCase-Control FrameworkSequence PoolingTests of Selection
collection DOAJ
language English
format Article
sources DOAJ
author Nitin eUdpa
Dan eZhou
Gabriel G Haddad
Gabriel G Haddad
Vineet eBafna
Vineet eBafna
spellingShingle Nitin eUdpa
Dan eZhou
Gabriel G Haddad
Gabriel G Haddad
Vineet eBafna
Vineet eBafna
Tests of Selection in Pooled Case-Control Data: An Empirical Study
Frontiers in Genetics
Case-Control Framework
Sequence Pooling
Tests of Selection
author_facet Nitin eUdpa
Dan eZhou
Gabriel G Haddad
Gabriel G Haddad
Vineet eBafna
Vineet eBafna
author_sort Nitin eUdpa
title Tests of Selection in Pooled Case-Control Data: An Empirical Study
title_short Tests of Selection in Pooled Case-Control Data: An Empirical Study
title_full Tests of Selection in Pooled Case-Control Data: An Empirical Study
title_fullStr Tests of Selection in Pooled Case-Control Data: An Empirical Study
title_full_unstemmed Tests of Selection in Pooled Case-Control Data: An Empirical Study
title_sort tests of selection in pooled case-control data: an empirical study
publisher Frontiers Media S.A.
series Frontiers in Genetics
issn 1664-8021
publishDate 2011-11-01
description For smaller organisms with faster breeding cycles, artificial selection can be used to create sub-populations with different phenotypic traits. Genetic tests can be employed to identify the causal markers for the phenotypes, as a precursor to engineering strains with a combination of traits. Traditional approaches involve analyzing crosses of inbred strains to test for co-segregation with genetic markers. Here we take advantage of cheaper next generation sequencing techniques to identifygenetic signatures of adaptation to the selection constraints. Obtaining individual sequencing data is often unrealistic due to cost and sample issues, so we focus on pooled genomic data.In this paper, we explore a series of statistical tests for selection using pooled case (under selection) and control populations. Extensive simulations are used to show that these approaches work well for a wide range of population divergence times and strong selective pressures. We show that pooling does not have a significant impact on statistical power. The tests are also robust to reasonable variations in several different parameters, including window size, base-calling error rate, and sequencing coverage. We then demonstrate the viability (and the challenges) of one of these methods in two independent Drosophila populations (Drosophila melanogaster) bred under selectionfor hypoxia and accelerated development, respectively. Testing for extreme hypoxia tolerance showed clear signals of selection, pointing to loci that are important for hypoxia adaptation.Overall, we outline a strategy for finding regions under selection using pooled sequences, then devise optimal tests for that strategy. The approaches show promise for detecting selection, even several generations after fixation of the beneficial allele has occurred.
topic Case-Control Framework
Sequence Pooling
Tests of Selection
url http://journal.frontiersin.org/Journal/10.3389/fgene.2011.00083/full
work_keys_str_mv AT nitineudpa testsofselectioninpooledcasecontroldataanempiricalstudy
AT danezhou testsofselectioninpooledcasecontroldataanempiricalstudy
AT gabrielghaddad testsofselectioninpooledcasecontroldataanempiricalstudy
AT gabrielghaddad testsofselectioninpooledcasecontroldataanempiricalstudy
AT vineetebafna testsofselectioninpooledcasecontroldataanempiricalstudy
AT vineetebafna testsofselectioninpooledcasecontroldataanempiricalstudy
_version_ 1725660442761625600