Detecting clusters of mutations.

Positive selection for protein function can lead to multiple mutations within a small stretch of DNA, i.e., to a cluster of mutations. Recently, Wagner proposed a method to detect such mutation clusters. His method, however, did not take into account that residues with high solvent accessibility are...

Full description

Bibliographic Details
Main Authors: Tong Zhou, Peter J Enyeart, Claus O Wilke
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2008-01-01
Series:PLoS ONE
Online Access:https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19018282/?tool=EBI
id doaj-30ee153206db49ecbf9e29b851503ea5
record_format Article
spelling doaj-30ee153206db49ecbf9e29b851503ea52021-03-03T19:55:27ZengPublic Library of Science (PLoS)PLoS ONE1932-62032008-01-01311e376510.1371/journal.pone.0003765Detecting clusters of mutations.Tong ZhouPeter J EnyeartClaus O WilkePositive selection for protein function can lead to multiple mutations within a small stretch of DNA, i.e., to a cluster of mutations. Recently, Wagner proposed a method to detect such mutation clusters. His method, however, did not take into account that residues with high solvent accessibility are inherently more variable than residues with low solvent accessibility. Here, we propose a new algorithm to detect clustered evolution. Our algorithm controls for different substitution probabilities at buried and exposed sites in the tertiary protein structure, and uses random permutations to calculate accurate P values for inferred clusters. We apply the algorithm to genomes of bacteria, fly, and mammals, and find several clusters of mutations in functionally important regions of proteins. Surprisingly, clustered evolution is a relatively rare phenomenon. Only between 2% and 10% of the genes we analyze contain a statistically significant mutation cluster. We also find that not controlling for solvent accessibility leads to an excess of clusters in terminal and solvent-exposed regions of proteins. Our algorithm provides a novel method to identify functionally relevant divergence between groups of species. Moreover, it could also be useful to detect artifacts in automatically assembled genomes.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19018282/?tool=EBI
collection DOAJ
language English
format Article
sources DOAJ
author Tong Zhou
Peter J Enyeart
Claus O Wilke
spellingShingle Tong Zhou
Peter J Enyeart
Claus O Wilke
Detecting clusters of mutations.
PLoS ONE
author_facet Tong Zhou
Peter J Enyeart
Claus O Wilke
author_sort Tong Zhou
title Detecting clusters of mutations.
title_short Detecting clusters of mutations.
title_full Detecting clusters of mutations.
title_fullStr Detecting clusters of mutations.
title_full_unstemmed Detecting clusters of mutations.
title_sort detecting clusters of mutations.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2008-01-01
description Positive selection for protein function can lead to multiple mutations within a small stretch of DNA, i.e., to a cluster of mutations. Recently, Wagner proposed a method to detect such mutation clusters. His method, however, did not take into account that residues with high solvent accessibility are inherently more variable than residues with low solvent accessibility. Here, we propose a new algorithm to detect clustered evolution. Our algorithm controls for different substitution probabilities at buried and exposed sites in the tertiary protein structure, and uses random permutations to calculate accurate P values for inferred clusters. We apply the algorithm to genomes of bacteria, fly, and mammals, and find several clusters of mutations in functionally important regions of proteins. Surprisingly, clustered evolution is a relatively rare phenomenon. Only between 2% and 10% of the genes we analyze contain a statistically significant mutation cluster. We also find that not controlling for solvent accessibility leads to an excess of clusters in terminal and solvent-exposed regions of proteins. Our algorithm provides a novel method to identify functionally relevant divergence between groups of species. Moreover, it could also be useful to detect artifacts in automatically assembled genomes.
url https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/19018282/?tool=EBI
work_keys_str_mv AT tongzhou detectingclustersofmutations
AT peterjenyeart detectingclustersofmutations
AT clausowilke detectingclustersofmutations
_version_ 1714825034623614976