Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
<p>Abstract</p> <p>Background</p> <p>Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functio...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2011-02-01
|
Series: | Biology Direct |
Online Access: | http://www.biology-direct.com/content/6/1/13 |
id |
doaj-91bcd16b533a442cbf21f6d6528a02b9 |
---|---|
record_format |
Article |
spelling |
doaj-91bcd16b533a442cbf21f6d6528a02b92020-11-24T21:35:56ZengBMCBiology Direct1745-61502011-02-01611310.1186/1745-6150-6-13Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genesWang LeiLiu FeiWang DapengHuang ShiYu Jun<p>Abstract</p> <p>Background</p> <p>Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functions and evolutionary dynamics are, through comparative genomics.</p> <p>Results</p> <p>We chose human and eleven other high-coverage mammalian genome data–as well as an avian genome as an outgroup–to analyze orthologous protein-coding genes using nonsynonymous (Ka) and synonymous (Ks) substitution rates. After evaluating eight commonly-used methods of Ka and Ks calculation, we observed that these methods yielded a nearly uniform result when estimating Ka, but not Ks (or Ka/Ks). When sorting genes based on Ka, we noticed that fast-evolving and slow-evolving genes often belonged to different functional classes, with respect to species-specificity and lineage-specificity. In particular, we identified two functional classes of genes in the acquired immune system. Fast-evolving genes coded for signal-transducing proteins, such as receptors, ligands, cytokines, and CDs (cluster of differentiation, mostly surface proteins), whereas the slow-evolving genes were for function-modulating proteins, such as kinases and adaptor proteins. In addition, among slow-evolving genes that had functions related to the central nervous system, neurodegenerative disease-related pathways were enriched significantly in most mammalian species. We also confirmed that gene expression was negatively correlated with evolution rate, i.e. slow-evolving genes were expressed at higher levels than fast-evolving genes. Our results indicated that the functional specializations of the three major mammalian clades were: sensory perception and oncogenesis in primates, reproduction and hormone regulation in large mammals, and immunity and angiotensin in rodents.</p> <p>Conclusion</p> <p>Our study suggests that Ka calculation, which is less biased compared to Ks and Ka/Ks, can be used as a parameter to sort genes by evolution rate and can also provide a way to categorize common protein functions and define their interaction networks, either pair-wise or in defined lineages or subgroups. Evaluating gene evolution based on Ka and Ks calculations can be done with large datasets, such as mammalian genomes.</p> <p>Reviewers</p> <p>This article has been reviewed by Drs. Anamaria Necsulea (nominated by Nicolas Galtier), Subhajyoti De (nominated by Sarah Teichmann) and Claus O. Wilke.</p> http://www.biology-direct.com/content/6/1/13 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wang Lei Liu Fei Wang Dapeng Huang Shi Yu Jun |
spellingShingle |
Wang Lei Liu Fei Wang Dapeng Huang Shi Yu Jun Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes Biology Direct |
author_facet |
Wang Lei Liu Fei Wang Dapeng Huang Shi Yu Jun |
author_sort |
Wang Lei |
title |
Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes |
title_short |
Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes |
title_full |
Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes |
title_fullStr |
Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes |
title_full_unstemmed |
Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes |
title_sort |
nonsynonymous substitution rate (ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes |
publisher |
BMC |
series |
Biology Direct |
issn |
1745-6150 |
publishDate |
2011-02-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functions and evolutionary dynamics are, through comparative genomics.</p> <p>Results</p> <p>We chose human and eleven other high-coverage mammalian genome data–as well as an avian genome as an outgroup–to analyze orthologous protein-coding genes using nonsynonymous (Ka) and synonymous (Ks) substitution rates. After evaluating eight commonly-used methods of Ka and Ks calculation, we observed that these methods yielded a nearly uniform result when estimating Ka, but not Ks (or Ka/Ks). When sorting genes based on Ka, we noticed that fast-evolving and slow-evolving genes often belonged to different functional classes, with respect to species-specificity and lineage-specificity. In particular, we identified two functional classes of genes in the acquired immune system. Fast-evolving genes coded for signal-transducing proteins, such as receptors, ligands, cytokines, and CDs (cluster of differentiation, mostly surface proteins), whereas the slow-evolving genes were for function-modulating proteins, such as kinases and adaptor proteins. In addition, among slow-evolving genes that had functions related to the central nervous system, neurodegenerative disease-related pathways were enriched significantly in most mammalian species. We also confirmed that gene expression was negatively correlated with evolution rate, i.e. slow-evolving genes were expressed at higher levels than fast-evolving genes. Our results indicated that the functional specializations of the three major mammalian clades were: sensory perception and oncogenesis in primates, reproduction and hormone regulation in large mammals, and immunity and angiotensin in rodents.</p> <p>Conclusion</p> <p>Our study suggests that Ka calculation, which is less biased compared to Ks and Ka/Ks, can be used as a parameter to sort genes by evolution rate and can also provide a way to categorize common protein functions and define their interaction networks, either pair-wise or in defined lineages or subgroups. Evaluating gene evolution based on Ka and Ks calculations can be done with large datasets, such as mammalian genomes.</p> <p>Reviewers</p> <p>This article has been reviewed by Drs. Anamaria Necsulea (nominated by Nicolas Galtier), Subhajyoti De (nominated by Sarah Teichmann) and Claus O. Wilke.</p> |
url |
http://www.biology-direct.com/content/6/1/13 |
work_keys_str_mv |
AT wanglei nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes AT liufei nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes AT wangdapeng nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes AT huangshi nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes AT yujun nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes |
_version_ |
1725943299132358656 |