Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes

<p>Abstract</p> <p>Background</p> <p>Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functio...

Full description

Bibliographic Details
Main Authors: Wang Lei, Liu Fei, Wang Dapeng, Huang Shi, Yu Jun
Format: Article
Language:English
Published: BMC 2011-02-01
Series:Biology Direct
Online Access:http://www.biology-direct.com/content/6/1/13
id doaj-91bcd16b533a442cbf21f6d6528a02b9
record_format Article
spelling doaj-91bcd16b533a442cbf21f6d6528a02b92020-11-24T21:35:56ZengBMCBiology Direct1745-61502011-02-01611310.1186/1745-6150-6-13Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genesWang LeiLiu FeiWang DapengHuang ShiYu Jun<p>Abstract</p> <p>Background</p> <p>Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functions and evolutionary dynamics are, through comparative genomics.</p> <p>Results</p> <p>We chose human and eleven other high-coverage mammalian genome data–as well as an avian genome as an outgroup–to analyze orthologous protein-coding genes using nonsynonymous (Ka) and synonymous (Ks) substitution rates. After evaluating eight commonly-used methods of Ka and Ks calculation, we observed that these methods yielded a nearly uniform result when estimating Ka, but not Ks (or Ka/Ks). When sorting genes based on Ka, we noticed that fast-evolving and slow-evolving genes often belonged to different functional classes, with respect to species-specificity and lineage-specificity. In particular, we identified two functional classes of genes in the acquired immune system. Fast-evolving genes coded for signal-transducing proteins, such as receptors, ligands, cytokines, and CDs (cluster of differentiation, mostly surface proteins), whereas the slow-evolving genes were for function-modulating proteins, such as kinases and adaptor proteins. In addition, among slow-evolving genes that had functions related to the central nervous system, neurodegenerative disease-related pathways were enriched significantly in most mammalian species. We also confirmed that gene expression was negatively correlated with evolution rate, i.e. slow-evolving genes were expressed at higher levels than fast-evolving genes. Our results indicated that the functional specializations of the three major mammalian clades were: sensory perception and oncogenesis in primates, reproduction and hormone regulation in large mammals, and immunity and angiotensin in rodents.</p> <p>Conclusion</p> <p>Our study suggests that Ka calculation, which is less biased compared to Ks and Ka/Ks, can be used as a parameter to sort genes by evolution rate and can also provide a way to categorize common protein functions and define their interaction networks, either pair-wise or in defined lineages or subgroups. Evaluating gene evolution based on Ka and Ks calculations can be done with large datasets, such as mammalian genomes.</p> <p>Reviewers</p> <p>This article has been reviewed by Drs. Anamaria Necsulea (nominated by Nicolas Galtier), Subhajyoti De (nominated by Sarah Teichmann) and Claus O. Wilke.</p> http://www.biology-direct.com/content/6/1/13
collection DOAJ
language English
format Article
sources DOAJ
author Wang Lei
Liu Fei
Wang Dapeng
Huang Shi
Yu Jun
spellingShingle Wang Lei
Liu Fei
Wang Dapeng
Huang Shi
Yu Jun
Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
Biology Direct
author_facet Wang Lei
Liu Fei
Wang Dapeng
Huang Shi
Yu Jun
author_sort Wang Lei
title Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
title_short Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
title_full Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
title_fullStr Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
title_full_unstemmed Nonsynonymous substitution rate (Ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
title_sort nonsynonymous substitution rate (ka) is a relatively consistent parameter for defining fast-evolving and slow-evolving protein-coding genes
publisher BMC
series Biology Direct
issn 1745-6150
publishDate 2011-02-01
description <p>Abstract</p> <p>Background</p> <p>Mammalian genome sequence data are being acquired in large quantities and at enormous speeds. We now have a tremendous opportunity to better understand which genes are the most variable or conserved, and what their particular functions and evolutionary dynamics are, through comparative genomics.</p> <p>Results</p> <p>We chose human and eleven other high-coverage mammalian genome data–as well as an avian genome as an outgroup–to analyze orthologous protein-coding genes using nonsynonymous (Ka) and synonymous (Ks) substitution rates. After evaluating eight commonly-used methods of Ka and Ks calculation, we observed that these methods yielded a nearly uniform result when estimating Ka, but not Ks (or Ka/Ks). When sorting genes based on Ka, we noticed that fast-evolving and slow-evolving genes often belonged to different functional classes, with respect to species-specificity and lineage-specificity. In particular, we identified two functional classes of genes in the acquired immune system. Fast-evolving genes coded for signal-transducing proteins, such as receptors, ligands, cytokines, and CDs (cluster of differentiation, mostly surface proteins), whereas the slow-evolving genes were for function-modulating proteins, such as kinases and adaptor proteins. In addition, among slow-evolving genes that had functions related to the central nervous system, neurodegenerative disease-related pathways were enriched significantly in most mammalian species. We also confirmed that gene expression was negatively correlated with evolution rate, i.e. slow-evolving genes were expressed at higher levels than fast-evolving genes. Our results indicated that the functional specializations of the three major mammalian clades were: sensory perception and oncogenesis in primates, reproduction and hormone regulation in large mammals, and immunity and angiotensin in rodents.</p> <p>Conclusion</p> <p>Our study suggests that Ka calculation, which is less biased compared to Ks and Ka/Ks, can be used as a parameter to sort genes by evolution rate and can also provide a way to categorize common protein functions and define their interaction networks, either pair-wise or in defined lineages or subgroups. Evaluating gene evolution based on Ka and Ks calculations can be done with large datasets, such as mammalian genomes.</p> <p>Reviewers</p> <p>This article has been reviewed by Drs. Anamaria Necsulea (nominated by Nicolas Galtier), Subhajyoti De (nominated by Sarah Teichmann) and Claus O. Wilke.</p>
url http://www.biology-direct.com/content/6/1/13
work_keys_str_mv AT wanglei nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes
AT liufei nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes
AT wangdapeng nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes
AT huangshi nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes
AT yujun nonsynonymoussubstitutionratekaisarelativelyconsistentparameterfordefiningfastevolvingandslowevolvingproteincodinggenes
_version_ 1725943299132358656