PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants
Non-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene re...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2017-03-01
|
Series: | Non-Coding RNA |
Subjects: | |
Online Access: | http://www.mdpi.com/2311-553X/3/1/11 |
id |
doaj-373e981c620841aeaae9b3a563d0e576 |
---|---|
record_format |
Article |
spelling |
doaj-373e981c620841aeaae9b3a563d0e5762020-11-25T01:29:28ZengMDPI AGNon-Coding RNA2311-553X2017-03-01311110.3390/ncrna3010011ncrna3010011PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in PlantsLucas Maciel Vieira0Clicia Grativol1Flavia Thiebaut2Thais G. Carvalho3Pablo R. Hardoim4Adriana Hemerly5Sergio Lifschitz6Paulo Cavalcanti Gomes Ferreira7Maria Emilia M. T. Walter8Departamento de Ciência da Computação, Universidade de Brasília, Brasília—DF 70910-900, BrasilLaboratório de Química e Função de Proteínas e Peptídeos, Universidade Estadual do Norte Fluminense, Campos dos Goytacazes—RJ 28013-602, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilDepartamento de Informática, Pontifícia Universidade Católica do Rio de Janeiro, Rio de Janeiro—RJ 22451-900, BrazilInstituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro—RJ 21941-901, BrazilDepartamento de Ciência da Computação, Universidade de Brasília, Brasília—DF 70910-900, BrasilNon-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene regulation and other cellular processes. Despite the importance of these lincRNAs, there is still a lack of biological knowledge and, currently, the few computational methods considered are so specific that they cannot be successfully applied to other species different from those that they have been originally designed to. Prediction of lncRNAs have been performed with machine learning techniques. Particularly, for lincRNA prediction, supervised learning methods have been explored in recent literature. As far as we know, there are no methods nor workflows specially designed to predict lincRNAs in plants. In this context, this work proposes a workflow to predict lincRNAs on plants, considering a workflow that includes known bioinformatics tools together with machine learning techniques, here a support vector machine (SVM). We discuss two case studies that allowed to identify novel lincRNAs, in sugarcane (Saccharum spp.) and in maize (Zea mays). From the results, we also could identify differentially-expressed lincRNAs in sugarcane and maize plants submitted to pathogenic and beneficial microorganisms.http://www.mdpi.com/2311-553X/3/1/11long non-coding RNAslong intergenic non-coding RNAsplantssugarcanemaizeSVM-based workflowbioinformatics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lucas Maciel Vieira Clicia Grativol Flavia Thiebaut Thais G. Carvalho Pablo R. Hardoim Adriana Hemerly Sergio Lifschitz Paulo Cavalcanti Gomes Ferreira Maria Emilia M. T. Walter |
spellingShingle |
Lucas Maciel Vieira Clicia Grativol Flavia Thiebaut Thais G. Carvalho Pablo R. Hardoim Adriana Hemerly Sergio Lifschitz Paulo Cavalcanti Gomes Ferreira Maria Emilia M. T. Walter PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants Non-Coding RNA long non-coding RNAs long intergenic non-coding RNAs plants sugarcane maize SVM-based workflow bioinformatics |
author_facet |
Lucas Maciel Vieira Clicia Grativol Flavia Thiebaut Thais G. Carvalho Pablo R. Hardoim Adriana Hemerly Sergio Lifschitz Paulo Cavalcanti Gomes Ferreira Maria Emilia M. T. Walter |
author_sort |
Lucas Maciel Vieira |
title |
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants |
title_short |
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants |
title_full |
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants |
title_fullStr |
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants |
title_full_unstemmed |
PlantRNA_Sniffer: A SVM-Based Workflow to Predict Long Intergenic Non-Coding RNAs in Plants |
title_sort |
plantrna_sniffer: a svm-based workflow to predict long intergenic non-coding rnas in plants |
publisher |
MDPI AG |
series |
Non-Coding RNA |
issn |
2311-553X |
publishDate |
2017-03-01 |
description |
Non-coding RNAs (ncRNAs) constitute an important set of transcripts produced in the cells of organisms. Among them, there is a large amount of a particular class of long ncRNAs that are difficult to predict, the so-called long intergenic ncRNAs (lincRNAs), which might play essential roles in gene regulation and other cellular processes. Despite the importance of these lincRNAs, there is still a lack of biological knowledge and, currently, the few computational methods considered are so specific that they cannot be successfully applied to other species different from those that they have been originally designed to. Prediction of lncRNAs have been performed with machine learning techniques. Particularly, for lincRNA prediction, supervised learning methods have been explored in recent literature. As far as we know, there are no methods nor workflows specially designed to predict lincRNAs in plants. In this context, this work proposes a workflow to predict lincRNAs on plants, considering a workflow that includes known bioinformatics tools together with machine learning techniques, here a support vector machine (SVM). We discuss two case studies that allowed to identify novel lincRNAs, in sugarcane (Saccharum spp.) and in maize (Zea mays). From the results, we also could identify differentially-expressed lincRNAs in sugarcane and maize plants submitted to pathogenic and beneficial microorganisms. |
topic |
long non-coding RNAs long intergenic non-coding RNAs plants sugarcane maize SVM-based workflow bioinformatics |
url |
http://www.mdpi.com/2311-553X/3/1/11 |
work_keys_str_mv |
AT lucasmacielvieira plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT cliciagrativol plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT flaviathiebaut plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT thaisgcarvalho plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT pablorhardoim plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT adrianahemerly plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT sergiolifschitz plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT paulocavalcantigomesferreira plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants AT mariaemiliamtwalter plantrnasnifferasvmbasedworkflowtopredictlongintergenicnoncodingrnasinplants |
_version_ |
1725096945869914112 |