A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.
Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral gene...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2018-02-01
|
Series: | PLoS Computational Biology |
Online Access: | https://doi.org/10.1371/journal.pcbi.1005958 |
id |
doaj-0da358d4cde3451ab7dd97a46e19171a |
---|---|
record_format |
Article |
spelling |
doaj-0da358d4cde3451ab7dd97a46e19171a2021-04-21T15:43:58ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582018-02-01142e100595810.1371/journal.pcbi.1005958A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination.Caitlin CollinsXavier DidelotGenome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.https://doi.org/10.1371/journal.pcbi.1005958 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Caitlin Collins Xavier Didelot |
spellingShingle |
Caitlin Collins Xavier Didelot A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. PLoS Computational Biology |
author_facet |
Caitlin Collins Xavier Didelot |
author_sort |
Caitlin Collins |
title |
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. |
title_short |
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. |
title_full |
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. |
title_fullStr |
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. |
title_full_unstemmed |
A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. |
title_sort |
phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Computational Biology |
issn |
1553-734X 1553-7358 |
publishDate |
2018-02-01 |
description |
Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS. |
url |
https://doi.org/10.1371/journal.pcbi.1005958 |
work_keys_str_mv |
AT caitlincollins aphylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination AT xavierdidelot aphylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination AT caitlincollins phylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination AT xavierdidelot phylogeneticmethodtoperformgenomewideassociationstudiesinmicrobesthataccountsforpopulationstructureandrecombination |
_version_ |
1714666965521399808 |