The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.

BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its...

Full description

Bibliographic Details
Main Authors: Huang Lulin, Yang Xiao, Sun Pei, Tong Wen, Hu Shangqin
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2012-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3378585?pdf=render
id doaj-e581c8ebcb7049c29742ae5c71084732
record_format Article
spelling doaj-e581c8ebcb7049c29742ae5c710847322020-11-25T01:22:44ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0176e3865310.1371/journal.pone.0038653The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.Huang LulinYang XiaoSun PeiTong WenHu ShangqinBACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of genes and their expression levels. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the genes and pathways that might control flavonoids and other secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the unigenes (about 58% of the identified unigenes) with cut-off E-values of 10(-5). In total, 21,943 of the safflower unigenes were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower and seed quality, respectively, we found that these genes were fairly well conserved in the safflower genome compared to those of other plants. CONCLUSIONS/SIGNIFICANCE: Our study provides abundant genomic data for Carthamus tinctorius L. and offers comprehensive sequence resources for studying the safflower. We believe that these transcriptome datasets will serve as an important public information platform to accelerate studies of the safflower genome, and may help us define the mechanisms of flower tissue-specific and secondary metabolism in this non-model plant.http://europepmc.org/articles/PMC3378585?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Huang Lulin
Yang Xiao
Sun Pei
Tong Wen
Hu Shangqin
spellingShingle Huang Lulin
Yang Xiao
Sun Pei
Tong Wen
Hu Shangqin
The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
PLoS ONE
author_facet Huang Lulin
Yang Xiao
Sun Pei
Tong Wen
Hu Shangqin
author_sort Huang Lulin
title The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
title_short The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
title_full The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
title_fullStr The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
title_full_unstemmed The first Illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
title_sort first illumina-based de novo transcriptome sequencing and analysis of safflower flowers.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2012-01-01
description BACKGROUND: The safflower, Carthamus tinctorius L., is a worldwide oil crop, and its flowers, which have a high flavonoid content, are an important medicinal resource against cardiovascular disease in traditional medicine. Because the safflower has a large and complex genome, the development of its genomic resources has been delayed. Second-generation Illumina sequencing is now an efficient route for generating an enormous volume of sequences that can represent a large number of genes and their expression levels. METHODOLOGY/PRINCIPAL FINDINGS: To investigate the genes and pathways that might control flavonoids and other secondary metabolites in the safflower, we used Illumina sequencing to perform a de novo assembly of the safflower tubular flower tissue transcriptome. We obtained a total of 4.69 Gb in clean nucleotides comprising 52,119,104 clean sequencing reads, 195,320 contigs, and 120,778 unigenes. Based on similarity searches with known proteins, we annotated 70,342 of the unigenes (about 58% of the identified unigenes) with cut-off E-values of 10(-5). In total, 21,943 of the safflower unigenes were found to have COG classifications, and BLAST2GO assigned 26,332 of the unigenes to 1,754 GO term annotations. In addition, we assigned 30,203 of the unigenes to 121 KEGG pathways. When we focused on genes identified as contributing to flavonoid biosynthesis and the biosynthesis of unsaturated fatty acids, which are important pathways that control flower and seed quality, respectively, we found that these genes were fairly well conserved in the safflower genome compared to those of other plants. CONCLUSIONS/SIGNIFICANCE: Our study provides abundant genomic data for Carthamus tinctorius L. and offers comprehensive sequence resources for studying the safflower. We believe that these transcriptome datasets will serve as an important public information platform to accelerate studies of the safflower genome, and may help us define the mechanisms of flower tissue-specific and secondary metabolism in this non-model plant.
url http://europepmc.org/articles/PMC3378585?pdf=render
work_keys_str_mv AT huanglulin thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT yangxiao thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT sunpei thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT tongwen thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT hushangqin thefirstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT huanglulin firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT yangxiao firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT sunpei firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT tongwen firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
AT hushangqin firstilluminabaseddenovotranscriptomesequencingandanalysisofsafflowerflowers
_version_ 1725125718441984000