Genome-wide analysis of codon usage bias in four sequenced cotton species.

Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. rai...

Full description

Bibliographic Details
Main Authors: Liyuan Wang, Huixian Xing, Yanchao Yuan, Xianlin Wang, Muhammad Saeed, Jincai Tao, Wei Feng, Guihua Zhang, Xianliang Song, Xuezhen Sun
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5870960?pdf=render
id doaj-b2c06bbfe5674b67a4ec1e56641b4f6a
record_format Article
spelling doaj-b2c06bbfe5674b67a4ec1e56641b4f6a2020-11-24T21:47:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01133e019437210.1371/journal.pone.0194372Genome-wide analysis of codon usage bias in four sequenced cotton species.Liyuan WangHuixian XingYanchao YuanXianlin WangMuhammad SaeedJincai TaoWei FengGuihua ZhangXianliang SongXuezhen SunCodon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.http://europepmc.org/articles/PMC5870960?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Liyuan Wang
Huixian Xing
Yanchao Yuan
Xianlin Wang
Muhammad Saeed
Jincai Tao
Wei Feng
Guihua Zhang
Xianliang Song
Xuezhen Sun
spellingShingle Liyuan Wang
Huixian Xing
Yanchao Yuan
Xianlin Wang
Muhammad Saeed
Jincai Tao
Wei Feng
Guihua Zhang
Xianliang Song
Xuezhen Sun
Genome-wide analysis of codon usage bias in four sequenced cotton species.
PLoS ONE
author_facet Liyuan Wang
Huixian Xing
Yanchao Yuan
Xianlin Wang
Muhammad Saeed
Jincai Tao
Wei Feng
Guihua Zhang
Xianliang Song
Xuezhen Sun
author_sort Liyuan Wang
title Genome-wide analysis of codon usage bias in four sequenced cotton species.
title_short Genome-wide analysis of codon usage bias in four sequenced cotton species.
title_full Genome-wide analysis of codon usage bias in four sequenced cotton species.
title_fullStr Genome-wide analysis of codon usage bias in four sequenced cotton species.
title_full_unstemmed Genome-wide analysis of codon usage bias in four sequenced cotton species.
title_sort genome-wide analysis of codon usage bias in four sequenced cotton species.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description Codon usage bias (CUB) is an important evolutionary feature in a genome which provides important information for studying organism evolution, gene function and exogenous gene expression. The CUB and its shaping factors in the nuclear genomes of four sequenced cotton species, G. arboreum (A2), G. raimondii (D5), G. hirsutum (AD1) and G. barbadense (AD2) were analyzed in the present study. The effective number of codons (ENC) analysis showed the CUB was weak in these four species and the four subgenomes of the two tetraploids. Codon composition analysis revealed these four species preferred to use pyrimidine-rich codons more frequently than purine-rich codons. Correlation analysis indicated that the base content at the third position of codons affect the degree of codon preference. PR2-bias plot and ENC-plot analyses revealed that the CUB patterns in these genomes and subgenomes were influenced by combined effects of translational selection, directional mutation and other factors. The translational selection (P2) analysis results, together with the non-significant correlation between GC12 and GC3, further revealed that translational selection played the dominant role over mutation pressure in the codon usage bias. Through relative synonymous codon usage (RSCU) analysis, we detected 25 high frequency codons preferred to end with T or A, and 31 low frequency codons inclined to end with C or G in these four species and four subgenomes. Finally, 19 to 26 optimal codons with 19 common ones were determined for each species and subgenomes, which preferred to end with A or T. We concluded that the codon usage bias was weak and the translation selection was the main shaping factor in nuclear genes of these four cotton genomes and four subgenomes.
url http://europepmc.org/articles/PMC5870960?pdf=render
work_keys_str_mv AT liyuanwang genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT huixianxing genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT yanchaoyuan genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT xianlinwang genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT muhammadsaeed genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT jincaitao genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT weifeng genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT guihuazhang genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT xianliangsong genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
AT xuezhensun genomewideanalysisofcodonusagebiasinfoursequencedcottonspecies
_version_ 1725895584504610816