PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants

Non-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene re...

Full description

Bibliographic Details
Main Authors: Lucas Maciel Vieira, Clicia Grativol, Flavia Thiebaut, Thais G. Carvalho, Pablo R. Hardoim, Adriana Hemerly, Sergio Lifschitz, Paulo Cavalcanti Gomes Ferreira, Maria Emilia M. T. Walter
Format: Article
Language:English
Published: MDPI AG 2017-03-01
Series:Non-Coding RNA
Subjects:
Online Access:http://www.mdpi.com/2311-553X/3/1/11
id doaj-373e981c620841aeaae9b3a563d0e576
record_format Article
spelling doaj-373e981c620841aeaae9b3a563d0e5762020-11-25T01:29:28ZengMDPI AGNon-Coding RNA2311-553X2017-03-01311110.3390/ncrna3010011ncrna3010011PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in PlantsLucas Maciel Vieira0Clicia Grativol1Flavia Thiebaut2Thais G. Carvalho3Pablo R. Hardoim4Adriana Hemerly5Sergio Lifschitz6Paulo Cavalcanti Gomes Ferreira7Maria Emilia M. T. Walter8Departamento de Ciência da Computação, Universidade de Brasília, Brasília—DF 70910-900, BrasilLaboratório de Química e Função de Proteínas e Peptídeos, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes—RJ 28013-602, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilDepartamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro—RJ 22451-900, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilDepartamento de Ciência da Computação, Universidade de Brasília, Brasília—DF 70910-900, BrasilNon-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene regulation and other cellular processes. Despite the importance of these lincRNAs, there is still a lack of biological knowledge and, currently, the few computational methods considered are so specific that they cannot be successfully applied to other species different from those that they have been originally designed to. Prediction of lncRNAs have been performed with machine learning techniques. Particularly, for lincRNA prediction, supervised learning methods have been explored in recent literature. As far as we know, there are no methods nor workflows specially designed to predict lincRNAs in plants. In this context, this work proposes a workflow to predict lincRNAs on plants, considering a workflow that includes known bioinformatics tools together with machine learning techniques, here a support vector machine (SVM). We discuss two case studies that allowed to identify novel lincRNAs, in sugarcane (Saccharum spp.) and in maize (Zea mays). From the results, we also could identify differentially-expressed lincRNAs in sugarcane and maize plants submitted to pathogenic and beneficial microorganisms.http://www.mdpi.com/2311-553X/3/1/11long non-coding RNAslong intergenic non-coding RNAsplantssugarcanemaizeSVM-based workflowbioinformatics
collection DOAJ
language English
format Article
sources DOAJ
author Lucas Maciel Vieira
Clicia Grativol
Flavia Thiebaut
Thais G. Carvalho
Pablo R. Hardoim
Adriana Hemerly
Sergio Lifschitz
Paulo Cavalcanti Gomes Ferreira
Maria Emilia M. T. Walter
spellingShingle Lucas Maciel Vieira
Clicia Grativol
Flavia Thiebaut
Thais G. Carvalho
Pablo R. Hardoim
Adriana Hemerly
Sergio Lifschitz
Paulo Cavalcanti Gomes Ferreira
Maria Emilia M. T. Walter
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
Non-Coding RNA
long non-coding RNAs
long intergenic non-coding RNAs
plants
sugarcane
maize
SVM-based workflow
bioinformatics
author_facet Lucas Maciel Vieira
Clicia Grativol
Flavia Thiebaut
Thais G. Carvalho
Pablo R. Hardoim
Adriana Hemerly
Sergio Lifschitz
Paulo Cavalcanti Gomes Ferreira
Maria Emilia M. T. Walter
author_sort Lucas Maciel Vieira
title PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
title_short PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
title_full PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
title_fullStr PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
title_full_unstemmed PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
title_sort plantrna_sniffer: a svm-based workflow to predict long intergenic non-coding rnas in plants
publisher MDPI AG
series Non-Coding RNA
issn 2311-553X
publishDate 2017-03-01
description Non-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene regulation and other cellular processes. Despite the importance of these lincRNAs, there is still a lack of biological knowledge and, currently, the few computational methods considered are so specific that they cannot be successfully applied to other species different from those that they have been originally designed to. Prediction of lncRNAs have been performed with machine learning techniques. Particularly, for lincRNA prediction, supervised learning methods have been explored in recent literature. As far as we know, there are no methods nor workflows specially designed to predict lincRNAs in plants. In this context, this work proposes a workflow to predict lincRNAs on plants, considering a workflow that includes known bioinformatics tools together with machine learning techniques, here a support vector machine (SVM). We discuss two case studies that allowed to identify novel lincRNAs, in sugarcane (Saccharum spp.) and in maize (Zea mays). From the results, we also could identify differentially-expressed lincRNAs in sugarcane and maize plants submitted to pathogenic and beneficial microorganisms.
topic long non-coding RNAs
long intergenic non-coding RNAs
plants
sugarcane
maize
SVM-based workflow
bioinformatics
url http://www.mdpi.com/2311-553X/3/1/11
work_keys_str_mv AT lucasmacielvieira plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT cliciagrativol plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT flaviathiebaut plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT thaisgcarvalho plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT pablorhardoim plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT adrianahemerly plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT sergiolifschitz plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT paulocavalcantigomesferreira plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
AT mariaemiliamtwalter plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants
_version_ 1725096945869914112