De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequenc...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2016-08-01
|
Series: | Frontiers in Plant Science |
Subjects: | |
Online Access: | http://journal.frontiersin.org/Journal/10.3389/fpls.2016.01203/full |
id |
doaj-685e0d7ed55b4a8890e79d0ff0fa1a48 |
---|---|
record_format |
Article |
spelling |
doaj-685e0d7ed55b4a8890e79d0ff0fa1a482020-11-24T22:26:54ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2016-08-01710.3389/fpls.2016.01203191160De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculataNeeraja Cherukupalli0Mayur Divate1Suresh Reddy Mittapelli2Venkateswara Rao Khareedu3Dashavantha Reddy Vudem4Osmania UniversityOsmania UniversityOsmania UniversityOsmania UniversityOsmania UniversityAndrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeqTM 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant nonredundant protein database, gene ontology and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts − using kyoto encyclopedia of genes and genomes database − revealed 5,606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6,767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A.paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analyses besides identification of key enzymes involved in the various pathways of secondary metabolism.http://journal.frontiersin.org/Journal/10.3389/fpls.2016.01203/fullcytochrome P450Terpenoid biosynthesissimple sequence repeatsDe-novo AssemblyAndrographis paniculataLeaf transcriptome |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Neeraja Cherukupalli Mayur Divate Suresh Reddy Mittapelli Venkateswara Rao Khareedu Dashavantha Reddy Vudem |
spellingShingle |
Neeraja Cherukupalli Mayur Divate Suresh Reddy Mittapelli Venkateswara Rao Khareedu Dashavantha Reddy Vudem De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata Frontiers in Plant Science cytochrome P450 Terpenoid biosynthesis simple sequence repeats De-novo Assembly Andrographis paniculata Leaf transcriptome |
author_facet |
Neeraja Cherukupalli Mayur Divate Suresh Reddy Mittapelli Venkateswara Rao Khareedu Dashavantha Reddy Vudem |
author_sort |
Neeraja Cherukupalli |
title |
De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata |
title_short |
De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata |
title_full |
De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata |
title_fullStr |
De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata |
title_full_unstemmed |
De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata |
title_sort |
de novo assembly of leaf transcriptome in the medicinal plant andrographis paniculata |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Plant Science |
issn |
1664-462X |
publishDate |
2016-08-01 |
description |
Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeqTM 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant nonredundant protein database, gene ontology and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts − using kyoto encyclopedia of genes and genomes database − revealed 5,606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6,767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A.paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analyses besides identification of key enzymes involved in the various pathways of secondary metabolism. |
topic |
cytochrome P450 Terpenoid biosynthesis simple sequence repeats De-novo Assembly Andrographis paniculata Leaf transcriptome |
url |
http://journal.frontiersin.org/Journal/10.3389/fpls.2016.01203/full |
work_keys_str_mv |
AT neerajacherukupalli denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata AT mayurdivate denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata AT sureshreddymittapelli denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata AT venkateswararaokhareedu denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata AT dashavanthareddyvudem denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata |
_version_ |
1725751202738601984 |