A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.

Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral gene...

Full description

Bibliographic Details
Main Authors: Caitlin Collins, Xavier Didelot
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-02-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1005958
id doaj-0da358d4cde3451ab7dd97a46e19171a
record_format Article
spelling doaj-0da358d4cde3451ab7dd97a46e19171a2021-04-21T15:43:58ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582018-02-01142e100595810.1371/journal.pcbi.1005958A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.Caitlin CollinsXavier DidelotGenome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.https://doi.org/10.1371/journal.pcbi.1005958
collection DOAJ
language English
format Article
sources DOAJ
author Caitlin Collins
Xavier Didelot
spellingShingle Caitlin Collins
Xavier Didelot
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
PLoS Computational Biology
author_facet Caitlin Collins
Xavier Didelot
author_sort Caitlin Collins
title A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
title_short A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
title_full A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
title_fullStr A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
title_full_unstemmed A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
title_sort phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2018-02-01
description Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.
url https://doi.org/10.1371/journal.pcbi.1005958
work_keys_str_mv AT caitlincollins aphylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
AT xavierdidelot aphylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
AT caitlincollins phylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
AT xavierdidelot phylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination
_version_ 1714666965521399808