Cloning, analysis and functional annotation of expressed sequence tags from the Earthworm <it>Eisenia fetida</it>

<p>Abstract</p> <p>Background</p> <p><it>Eisenia fetida</it>, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test o...

Full description

Bibliographic Details
Main Authors: Yang Kuan, Inouye Laura S, Guan Xin, Gong Ping, Pirooznia Mehdi, Perkins Edward J, Deng Youping
Format: Article
Language:English
Published: BMC 2007-11-01
Series:BMC Bioinformatics
Description
Summary:<p>Abstract</p> <p>Background</p> <p><it>Eisenia fetida</it>, commonly known as red wiggler or compost worm, belongs to the Lumbricidae family of the Annelida phylum. Little is known about its genome sequence although it has been extensively used as a test organism in terrestrial ecotoxicology. In order to understand its gene expression response to environmental contaminants, we cloned 4032 cDNAs or expressed sequence tags (ESTs) from two <it>E. fetida </it>libraries enriched with genes responsive to ten ordnance related compounds using suppressive subtractive hybridization-PCR.</p> <p>Results</p> <p>A total of 3144 good quality ESTs (GenBank dbEST accession number <ext-link ext-link-type="gen" ext-link-id="EH669363">EH669363</ext-link>–<ext-link ext-link-type="gen" ext-link-id="EH672369">EH672369</ext-link> and <ext-link ext-link-type="gen" ext-link-id="EL515444">EL515444</ext-link>–<ext-link ext-link-type="gen" ext-link-id="EL515580">EL515580</ext-link>) were obtained from the raw clone sequences after cleaning. Clustering analysis yielded 2231 unique sequences including 448 contigs (from 1361 ESTs) and 1783 singletons. Comparative genomic analysis showed that 743 or 33% of the unique sequences shared high similarity with existing genes in the GenBank nr database. Provisional function annotation assigned 830 Gene Ontology terms to 517 unique sequences based on their homology with the annotated genomes of four model organisms <it>Drosophila melanogaster</it>, <it>Mus musculus</it>, <it>Saccharomyces cerevisiae</it>, and <it>Caenorhabditis elegans</it>. Seven percent of the unique sequences were further mapped to 99 Kyoto Encyclopedia of Genes and Genomes pathways based on their matching Enzyme Commission numbers. All the information is stored and retrievable at a highly performed, web-based and user-friendly relational database called EST model database or ESTMD version 2.</p> <p>Conclusion</p> <p>The ESTMD containing the sequence and annotation information of 4032 <it>E. fetida </it>ESTs is publicly accessible at <url>http://mcbc.usm.edu/estmd/</url>.</p>
ISSN:1471-2105