Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines
Abstract The development of genetically modified crops (GM) includes the discovery of candidate genes through bioinformatics analysis using genomics data, gene expression, and others. Proteins of unknown function (PUFs) are interesting targets for GM crops breeding pipelines for the novelty associat...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-02-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-020-03792-z |
id |
doaj-a4628b478e804e9dbca01cbba4a086e9 |
---|---|
record_format |
Article |
spelling |
doaj-a4628b478e804e9dbca01cbba4a086e92021-02-07T12:52:26ZengBMCBMC Bioinformatics1471-21052021-02-0122111410.1186/s12859-020-03792-zPlant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelinesMarcos José Andrade Viana0Adhemar Zerlotini1Mauricio de Alvarenga Mudadu2Graduate Program in Bioinformatics, Institute of Biological Sciences, Universidade Federal de Minas GeraisEmbrapa Informática AgropecuáriaEmbrapa Informática AgropecuáriaAbstract The development of genetically modified crops (GM) includes the discovery of candidate genes through bioinformatics analysis using genomics data, gene expression, and others. Proteins of unknown function (PUFs) are interesting targets for GM crops breeding pipelines for the novelty associated with such targets and also to avoid copyright protection. One method of inferring the putative function of PUFs is by relating them to factors of interest such as abiotic stresses using orthology and co-expression networks, in a guilt-by-association manner. In this regard, we have downloaded, analyzed, and processed genomics data of 53 angiosperms, totaling 1,862,010 genes and 2,332,974 RNA. Diamond and InterproScan were used to discover 72,266 PUFs for all organisms. RNA-seq datasets related to abiotic stresses were downloaded from NCBI/GEO. The RNA-seq data was used as input to the LSTrAP software to construct co-expression networks. LSTrAP also created clusters of transcripts with correlated expression, whose members are more probably related to the molecular mechanisms associated with abiotic stresses in the plants. Orthologous groups were created (OrhtoMCL) using all 2,332,974 proteins in order to associate PUFs to abiotic stress-related clusters of co-expression and therefore infer their function in a guilt-by-association manner. A freely available web resource named “Plant Co-expression Annotation Resource” ( https://www.machado.cnptia.embrapa.br/plantannot ), Plantannot, was created to provide indexed queries to search for PUF putatively associated with abiotic stresses. The web interface also allows browsing, querying, and retrieving of public genomics data from 53 plants. We hope Plantannot to be useful for researchers trying to obtain novel GM crops resistant to climate change hazards.https://doi.org/10.1186/s12859-020-03792-zProteins of unknown functionAnnotationAbiotic stressDatabase |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Marcos José Andrade Viana Adhemar Zerlotini Mauricio de Alvarenga Mudadu |
spellingShingle |
Marcos José Andrade Viana Adhemar Zerlotini Mauricio de Alvarenga Mudadu Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines BMC Bioinformatics Proteins of unknown function Annotation Abiotic stress Database |
author_facet |
Marcos José Andrade Viana Adhemar Zerlotini Mauricio de Alvarenga Mudadu |
author_sort |
Marcos José Andrade Viana |
title |
Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines |
title_short |
Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines |
title_full |
Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines |
title_fullStr |
Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines |
title_full_unstemmed |
Plant Co-expression Annotation Resource: a web server for identifying targets for genetically modified crop breeding pipelines |
title_sort |
plant co-expression annotation resource: a web server for identifying targets for genetically modified crop breeding pipelines |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2021-02-01 |
description |
Abstract The development of genetically modified crops (GM) includes the discovery of candidate genes through bioinformatics analysis using genomics data, gene expression, and others. Proteins of unknown function (PUFs) are interesting targets for GM crops breeding pipelines for the novelty associated with such targets and also to avoid copyright protection. One method of inferring the putative function of PUFs is by relating them to factors of interest such as abiotic stresses using orthology and co-expression networks, in a guilt-by-association manner. In this regard, we have downloaded, analyzed, and processed genomics data of 53 angiosperms, totaling 1,862,010 genes and 2,332,974 RNA. Diamond and InterproScan were used to discover 72,266 PUFs for all organisms. RNA-seq datasets related to abiotic stresses were downloaded from NCBI/GEO. The RNA-seq data was used as input to the LSTrAP software to construct co-expression networks. LSTrAP also created clusters of transcripts with correlated expression, whose members are more probably related to the molecular mechanisms associated with abiotic stresses in the plants. Orthologous groups were created (OrhtoMCL) using all 2,332,974 proteins in order to associate PUFs to abiotic stress-related clusters of co-expression and therefore infer their function in a guilt-by-association manner. A freely available web resource named “Plant Co-expression Annotation Resource” ( https://www.machado.cnptia.embrapa.br/plantannot ), Plantannot, was created to provide indexed queries to search for PUF putatively associated with abiotic stresses. The web interface also allows browsing, querying, and retrieving of public genomics data from 53 plants. We hope Plantannot to be useful for researchers trying to obtain novel GM crops resistant to climate change hazards. |
topic |
Proteins of unknown function Annotation Abiotic stress Database |
url |
https://doi.org/10.1186/s12859-020-03792-z |
work_keys_str_mv |
AT marcosjoseandradeviana plantcoexpressionannotationresourceawebserverforidentifyingtargetsforgeneticallymodifiedcropbreedingpipelines AT adhemarzerlotini plantcoexpressionannotationresourceawebserverforidentifyingtargetsforgeneticallymodifiedcropbreedingpipelines AT mauriciodealvarengamudadu plantcoexpressionannotationresourceawebserverforidentifyingtargetsforgeneticallymodifiedcropbreedingpipelines |
_version_ |
1724280663947870208 |