A novel evolutionary model for constructing gene coexpression networks with comprehensive features

Abstract Background Uncovering the evolutionary principles of gene coexpression network is important for our understanding of the network topological property of new genes. However, most existing evolutionary models only considered the evolution of duplication genes and only based on the degree of g...

Full description

Bibliographic Details
Main Authors: Yuexi Gu, Jian Zu, Yu Li
Format: Article
Language:English
Published: BMC 2019-09-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-019-3035-7
id doaj-5ada57fb70924bd6b81c8e2795a19cda
record_format Article
spelling doaj-5ada57fb70924bd6b81c8e2795a19cda2020-11-25T03:46:46ZengBMCBMC Bioinformatics1471-21052019-09-0120112010.1186/s12859-019-3035-7A novel evolutionary model for constructing gene coexpression networks with comprehensive featuresYuexi Gu0Jian Zu1Yu Li2School of Mathematics and Statistics, Xi’an Jiaotong UniversitySchool of Mathematics and Statistics, Xi’an Jiaotong UniversitySchool of Mathematics and Statistics, Xi’an Jiaotong UniversityAbstract Background Uncovering the evolutionary principles of gene coexpression network is important for our understanding of the network topological property of new genes. However, most existing evolutionary models only considered the evolution of duplication genes and only based on the degree of genes, ignoring the other key topological properties. The evolutionary mechanism by which how are new genes integrated into the ancestral networks are not yet to be comprehensively characterized. Herein, based on the human ribonucleic acid-sequencing (RNA-seq) data, we develop a new evolutionary model of gene coexpression network which considers the evolutionary process of both duplication genes and de novo genes. Results Based on the human RNA-seq data, we construct a gene coexpression network consisting of 8061 genes and 638624 links. We find that there are 1394 duplication genes and 126 de novo genes in the network. Then based on human gene age data, we reproduce the evolutionary process of this gene coexpression network and develop a new evolutionary model. We find that the generation rates of duplication genes and de novo genes are approximately 3.58/Myr (Myr=Million year) and 0.31/Myr, respectively. Based on the average degree and coreness of parent genes, we find that the gene duplication is a random process. Eventually duplication genes only inherit 12.89% connections from their parent genes and the retained connections have a smaller edge betweenness. Moreover, we find that both duplication genes and de novo genes prefer to develop new interactions with genes which have a large degree and a large coreness. Our proposed model can generate an evolutionary network when the number of newly added genes or the length of evolutionary time is known. Conclusions Gene duplication and de novo genes are two dominant evolutionary forces in shaping the coexpression network. Both duplication genes and de novo genes develop new interactions through a “rich-gets-richer" mechanism in terms of degree and coreness. This mechanism leads to the scale-free property and hierarchical architecture of biomolecular network. The proposed model is able to construct a gene coexpression network with comprehensive biological characteristics.http://link.springer.com/article/10.1186/s12859-019-3035-7GenomeNetwork evolutionNetwork biologyDuplication genesDe novo genes
collection DOAJ
language English
format Article
sources DOAJ
author Yuexi Gu
Jian Zu
Yu Li
spellingShingle Yuexi Gu
Jian Zu
Yu Li
A novel evolutionary model for constructing gene coexpression networks with comprehensive features
BMC Bioinformatics
Genome
Network evolution
Network biology
Duplication genes
De novo genes
author_facet Yuexi Gu
Jian Zu
Yu Li
author_sort Yuexi Gu
title A novel evolutionary model for constructing gene coexpression networks with comprehensive features
title_short A novel evolutionary model for constructing gene coexpression networks with comprehensive features
title_full A novel evolutionary model for constructing gene coexpression networks with comprehensive features
title_fullStr A novel evolutionary model for constructing gene coexpression networks with comprehensive features
title_full_unstemmed A novel evolutionary model for constructing gene coexpression networks with comprehensive features
title_sort novel evolutionary model for constructing gene coexpression networks with comprehensive features
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2019-09-01
description Abstract Background Uncovering the evolutionary principles of gene coexpression network is important for our understanding of the network topological property of new genes. However, most existing evolutionary models only considered the evolution of duplication genes and only based on the degree of genes, ignoring the other key topological properties. The evolutionary mechanism by which how are new genes integrated into the ancestral networks are not yet to be comprehensively characterized. Herein, based on the human ribonucleic acid-sequencing (RNA-seq) data, we develop a new evolutionary model of gene coexpression network which considers the evolutionary process of both duplication genes and de novo genes. Results Based on the human RNA-seq data, we construct a gene coexpression network consisting of 8061 genes and 638624 links. We find that there are 1394 duplication genes and 126 de novo genes in the network. Then based on human gene age data, we reproduce the evolutionary process of this gene coexpression network and develop a new evolutionary model. We find that the generation rates of duplication genes and de novo genes are approximately 3.58/Myr (Myr=Million year) and 0.31/Myr, respectively. Based on the average degree and coreness of parent genes, we find that the gene duplication is a random process. Eventually duplication genes only inherit 12.89% connections from their parent genes and the retained connections have a smaller edge betweenness. Moreover, we find that both duplication genes and de novo genes prefer to develop new interactions with genes which have a large degree and a large coreness. Our proposed model can generate an evolutionary network when the number of newly added genes or the length of evolutionary time is known. Conclusions Gene duplication and de novo genes are two dominant evolutionary forces in shaping the coexpression network. Both duplication genes and de novo genes develop new interactions through a “rich-gets-richer" mechanism in terms of degree and coreness. This mechanism leads to the scale-free property and hierarchical architecture of biomolecular network. The proposed model is able to construct a gene coexpression network with comprehensive biological characteristics.
topic Genome
Network evolution
Network biology
Duplication genes
De novo genes
url http://link.springer.com/article/10.1186/s12859-019-3035-7
work_keys_str_mv AT yuexigu anovelevolutionarymodelforconstructinggenecoexpressionnetworkswithcomprehensivefeatures
AT jianzu anovelevolutionarymodelforconstructinggenecoexpressionnetworkswithcomprehensivefeatures
AT yuli anovelevolutionarymodelforconstructinggenecoexpressionnetworkswithcomprehensivefeatures
AT yuexigu novelevolutionarymodelforconstructinggenecoexpressionnetworkswithcomprehensivefeatures
AT jianzu novelevolutionarymodelforconstructinggenecoexpressionnetworkswithcomprehensivefeatures
AT yuli novelevolutionarymodelforconstructinggenecoexpressionnetworkswithcomprehensivefeatures
_version_ 1724504286421843968