wDBTF: an integrated database resource for studying wheat transcription factor families

<p>Abstract</p> <p>Background</p> <p>Transcription factors (TFs) regulate gene expression by interacting with promoters of their target genes and are classified into families based on their DNA-binding domains. Genes coding for TFs have been identified in the sequences...

Full description

Bibliographic Details
Main Authors: Ravel Catherine, Charmet Gilles, Branlard Gérard, Tessier Dominique, Dardevet Mireille, Romeuf Isabelle
Format: Article
Language:English
Published: BMC 2010-03-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/11/185
id doaj-30a508ec4ace4ee9a9b74c8dd0481552
record_format Article
spelling doaj-30a508ec4ace4ee9a9b74c8dd04815522020-11-24T21:19:21ZengBMCBMC Genomics1471-21642010-03-0111118510.1186/1471-2164-11-185wDBTF: an integrated database resource for studying wheat transcription factor familiesRavel CatherineCharmet GillesBranlard GérardTessier DominiqueDardevet MireilleRomeuf Isabelle<p>Abstract</p> <p>Background</p> <p>Transcription factors (TFs) regulate gene expression by interacting with promoters of their target genes and are classified into families based on their DNA-binding domains. Genes coding for TFs have been identified in the sequences of model plant genomes. The rice (<it>Oryza sativa </it>spp. <it>japonica</it>) genome contains 2,384 TF gene models, which represent the mRNA transcript of a locus, classed into 63 families.</p> <p>Results</p> <p>We have created an extensive list of wheat (<it>Triticum aestivum </it>L) TF sequences based on sequence homology with rice TFs identified and classified in the Database of Rice Transcription Factors (DRTF). We have identified 7,112 wheat sequences (contigs and singletons) from a dataset of 1,033,960 expressed sequence tag and mRNA (ET) sequences available. This number is about three times the number of TFs in rice so proportionally is very similar if allowance is made for the hexaploidy of wheat. Of these sequences 3,820 encode gene products with a DNA-binding domain and thus were confirmed as potential regulators. These 3,820 sequences were classified into 40 families and 84 subfamilies and some members defined orphan families. The results were compiled in the Database of Wheat Transcription Factor (wDBTF), an inventory available on the web <url>http://wwwappli.nantes.inra.fr:8180/wDBFT/</url>. For each accession, a link to its library source and its Affymetrix identification number is provided. The positions of Pfam (protein family database) motifs were given when known.</p> <p>Conclusions</p> <p>wDBTF collates 3,820 wheat TF sequences validated by the presence of a DNA-binding domain out of 7,112 potential TF sequences identified from publicly available gene expression data. We also incorporated <it>in silico </it>expression data on these TFs into the database. Thus this database provides a major resource for systematic studies of TF families and their expression in wheat as illustrated here in a study of DOF family members expressed during seed development.</p> http://www.biomedcentral.com/1471-2164/11/185
collection DOAJ
language English
format Article
sources DOAJ
author Ravel Catherine
Charmet Gilles
Branlard Gérard
Tessier Dominique
Dardevet Mireille
Romeuf Isabelle
spellingShingle Ravel Catherine
Charmet Gilles
Branlard Gérard
Tessier Dominique
Dardevet Mireille
Romeuf Isabelle
wDBTF: an integrated database resource for studying wheat transcription factor families
BMC Genomics
author_facet Ravel Catherine
Charmet Gilles
Branlard Gérard
Tessier Dominique
Dardevet Mireille
Romeuf Isabelle
author_sort Ravel Catherine
title wDBTF: an integrated database resource for studying wheat transcription factor families
title_short wDBTF: an integrated database resource for studying wheat transcription factor families
title_full wDBTF: an integrated database resource for studying wheat transcription factor families
title_fullStr wDBTF: an integrated database resource for studying wheat transcription factor families
title_full_unstemmed wDBTF: an integrated database resource for studying wheat transcription factor families
title_sort wdbtf: an integrated database resource for studying wheat transcription factor families
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2010-03-01
description <p>Abstract</p> <p>Background</p> <p>Transcription factors (TFs) regulate gene expression by interacting with promoters of their target genes and are classified into families based on their DNA-binding domains. Genes coding for TFs have been identified in the sequences of model plant genomes. The rice (<it>Oryza sativa </it>spp. <it>japonica</it>) genome contains 2,384 TF gene models, which represent the mRNA transcript of a locus, classed into 63 families.</p> <p>Results</p> <p>We have created an extensive list of wheat (<it>Triticum aestivum </it>L) TF sequences based on sequence homology with rice TFs identified and classified in the Database of Rice Transcription Factors (DRTF). We have identified 7,112 wheat sequences (contigs and singletons) from a dataset of 1,033,960 expressed sequence tag and mRNA (ET) sequences available. This number is about three times the number of TFs in rice so proportionally is very similar if allowance is made for the hexaploidy of wheat. Of these sequences 3,820 encode gene products with a DNA-binding domain and thus were confirmed as potential regulators. These 3,820 sequences were classified into 40 families and 84 subfamilies and some members defined orphan families. The results were compiled in the Database of Wheat Transcription Factor (wDBTF), an inventory available on the web <url>http://wwwappli.nantes.inra.fr:8180/wDBFT/</url>. For each accession, a link to its library source and its Affymetrix identification number is provided. The positions of Pfam (protein family database) motifs were given when known.</p> <p>Conclusions</p> <p>wDBTF collates 3,820 wheat TF sequences validated by the presence of a DNA-binding domain out of 7,112 potential TF sequences identified from publicly available gene expression data. We also incorporated <it>in silico </it>expression data on these TFs into the database. Thus this database provides a major resource for systematic studies of TF families and their expression in wheat as illustrated here in a study of DOF family members expressed during seed development.</p>
url http://www.biomedcentral.com/1471-2164/11/185
work_keys_str_mv AT ravelcatherine wdbtfanintegrateddatabaseresourceforstudyingwheattranscriptionfactorfamilies
AT charmetgilles wdbtfanintegrateddatabaseresourceforstudyingwheattranscriptionfactorfamilies
AT branlardgerard wdbtfanintegrateddatabaseresourceforstudyingwheattranscriptionfactorfamilies
AT tessierdominique wdbtfanintegrateddatabaseresourceforstudyingwheattranscriptionfactorfamilies
AT dardevetmireille wdbtfanintegrateddatabaseresourceforstudyingwheattranscriptionfactorfamilies
AT romeufisabelle wdbtfanintegrateddatabaseresourceforstudyingwheattranscriptionfactorfamilies
_version_ 1726005886497849344