The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3378585?pdf=render |
id |
doaj-e581c8ebcb7049c29742ae5c71084732 |
---|---|
record_format |
Article |
spelling |
doaj-e581c8ebcb7049c29742ae5c710847322020-11-25T01:22:44ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0176e3865310.1371/journal.pone.0038653The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.Huang LulinYang XiaoSun PeiTong WenHu ShangqinBACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of genes and their expression levels. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the genes and pathways that might control flavonoids and other secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the unigenes (about 58% of the identified unigenes) with cut-off E-values of 10(-5). In total, 21,943 of the safflower unigenes were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower and seed quality, respectively, we found that these genes were fairly well conserved in the safflower genome compared to those of other plants. CONCLUSIONS/SIGNIFICANCE: Our study provides abundant genomic data for Carthamus tinctorius L. and offers comprehensive sequence resources for studying the safflower. We believe that these transcriptome datasets will serve as an important public information platform to accelerate studies of the safflower genome, and may help us define the mechanisms of flower tissue-specific and secondary metabolism in this non-model plant.http://europepmc.org/articles/PMC3378585?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Huang Lulin Yang Xiao Sun Pei Tong Wen Hu Shangqin |
spellingShingle |
Huang Lulin Yang Xiao Sun Pei Tong Wen Hu Shangqin The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. PLoS ONE |
author_facet |
Huang Lulin Yang Xiao Sun Pei Tong Wen Hu Shangqin |
author_sort |
Huang Lulin |
title |
The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. |
title_short |
The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. |
title_full |
The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. |
title_fullStr |
The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. |
title_full_unstemmed |
The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers. |
title_sort |
first illumina-based de novo transcriptome sequencing and analysis of safflower flowers. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2012-01-01 |
description |
BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of genes and their expression levels. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the genes and pathways that might control flavonoids and other secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the unigenes (about 58% of the identified unigenes) with cut-off E-values of 10(-5). In total, 21,943 of the safflower unigenes were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower and seed quality, respectively, we found that these genes were fairly well conserved in the safflower genome compared to those of other plants. CONCLUSIONS/SIGNIFICANCE: Our study provides abundant genomic data for Carthamus tinctorius L. and offers comprehensive sequence resources for studying the safflower. We believe that these transcriptome datasets will serve as an important public information platform to accelerate studies of the safflower genome, and may help us define the mechanisms of flower tissue-specific and secondary metabolism in this non-model plant. |
url |
http://europepmc.org/articles/PMC3378585?pdf=render |
work_keys_str_mv |
AT huanglulin thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT yangxiao thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT sunpei thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT tongwen thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT hushangqin thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT huanglulin firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT yangxiao firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT sunpei firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT tongwen firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers AT hushangqin firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers |
_version_ |
1725125718441984000 |