De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata

Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequenc...

Full description

Bibliographic Details
Main Authors: Neeraja Cherukupalli, Mayur Divate, Suresh Reddy Mittapelli, Venkateswara Rao Khareedu, Dashavantha Reddy Vudem
Format: Article
Language:English
Published: Frontiers Media S.A. 2016-08-01
Series:Frontiers in Plant Science
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fpls.2016.01203/full
id doaj-685e0d7ed55b4a8890e79d0ff0fa1a48
record_format Article
spelling doaj-685e0d7ed55b4a8890e79d0ff0fa1a482020-11-24T22:26:54ZengFrontiers Media S.A.Frontiers in Plant Science1664-462X2016-08-01710.3389/fpls.2016.01203191160De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculataNeeraja Cherukupalli0Mayur Divate1Suresh Reddy Mittapelli2Venkateswara Rao Khareedu3Dashavantha Reddy Vudem4Osmania UniversityOsmania UniversityOsmania UniversityOsmania UniversityOsmania UniversityAndrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeqTM 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant nonredundant protein database, gene ontology and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts − using kyoto encyclopedia of genes and genomes database − revealed 5,606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6,767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A.paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analyses besides identification of key enzymes involved in the various pathways of secondary metabolism.http://journal.frontiersin.org/Journal/10.3389/fpls.2016.01203/fullcytochrome P450Terpenoid biosynthesissimple sequence repeatsDe-novo AssemblyAndrographis paniculataLeaf transcriptome
collection DOAJ
language English
format Article
sources DOAJ
author Neeraja Cherukupalli
Mayur Divate
Suresh Reddy Mittapelli
Venkateswara Rao Khareedu
Dashavantha Reddy Vudem
spellingShingle Neeraja Cherukupalli
Mayur Divate
Suresh Reddy Mittapelli
Venkateswara Rao Khareedu
Dashavantha Reddy Vudem
De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
Frontiers in Plant Science
cytochrome P450
Terpenoid biosynthesis
simple sequence repeats
De-novo Assembly
Andrographis paniculata
Leaf transcriptome
author_facet Neeraja Cherukupalli
Mayur Divate
Suresh Reddy Mittapelli
Venkateswara Rao Khareedu
Dashavantha Reddy Vudem
author_sort Neeraja Cherukupalli
title De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
title_short De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
title_full De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
title_fullStr De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
title_full_unstemmed De novo assembly of leaf transcriptome in the medicinal plant Andrographis paniculata
title_sort de novo assembly of leaf transcriptome in the medicinal plant andrographis paniculata
publisher Frontiers Media S.A.
series Frontiers in Plant Science
issn 1664-462X
publishDate 2016-08-01
description Andrographis paniculata is an important medicinal plant containing various bioactive terpenoids and flavonoids. Despite its importance in herbal medicine, no ready-to-use transcript sequence information of this plant is made available in the public data base, this study mainly deals with the sequencing of RNA from A. paniculata leaf using Illumina HiSeqTM 2000 platform followed by the de novo transcriptome assembly. A total of 189.22 million high quality paired reads were generated and 1,70,724 transcripts were predicted in the primary assembly. Secondary assembly generated a transcriptome size of ~88 Mb with 83,800 clustered transcripts. Based on the similarity searches against plant nonredundant protein database, gene ontology and eukaryotic orthologous groups, 49,363 transcripts were annotated constituting upto 58.91% of the identified unigenes. Annotation of transcripts − using kyoto encyclopedia of genes and genomes database − revealed 5,606 transcripts plausibly involved in 140 pathways including biosynthesis of terpenoids and other secondary metabolites. Transcription factor analysis showed 6,767 unique transcripts belonging to 97 different transcription factor families. A total number of 124 CYP450 transcripts belonging to seven divergent clans have been identified. Transcriptome revealed 146 different transcripts coding for enzymes involved in the biosynthesis of terpenoids of which 35 contained terpene synthase motifs. This study also revealed 32,341 simple sequence repeats (SSRs) in 23,168 transcripts. Assembled sequences of transcriptome of A.paniculata generated in this study are made available, for the first time, in the TSA database, which provides useful information for functional and comparative genomic analyses besides identification of key enzymes involved in the various pathways of secondary metabolism.
topic cytochrome P450
Terpenoid biosynthesis
simple sequence repeats
De-novo Assembly
Andrographis paniculata
Leaf transcriptome
url http://journal.frontiersin.org/Journal/10.3389/fpls.2016.01203/full
work_keys_str_mv AT neerajacherukupalli denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata
AT mayurdivate denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata
AT sureshreddymittapelli denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata
AT venkateswararaokhareedu denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata
AT dashavanthareddyvudem denovoassemblyofleaftranscriptomeinthemedicinalplantandrographispaniculata
_version_ 1725751202738601984