Computational prediction of miRNA genes from small RNA sequencing data

Next-generation sequencing now for the first time allows researchers to gauge the depth and variation of entire transcriptomes. However, now as rare transcripts can be detected that are present in cells at single copies, more advanced computational tools are needed to accurately annotate and profile...

Full description

Bibliographic Details
Main Authors: Wenjing eKang, Marc Riemer Friedländer
Format: Article
Language:English
Published: Frontiers Media S.A. 2015-01-01
Series:Frontiers in Bioengineering and Biotechnology
Subjects:
Online Access:http://journal.frontiersin.org/Journal/10.3389/fbioe.2015.00007/full
id doaj-bf5bca5a8bf248dab13764a71efebfb9
record_format Article
spelling doaj-bf5bca5a8bf248dab13764a71efebfb92020-11-24T21:21:14ZengFrontiers Media S.A.Frontiers in Bioengineering and Biotechnology2296-41852015-01-01310.3389/fbioe.2015.00007122792Computational prediction of miRNA genes from small RNA sequencing dataWenjing eKang0Marc Riemer Friedländer1Science for Life Laboratory / Stockholm UniversityScience for Life Laboratory / Stockholm UniversityNext-generation sequencing now for the first time allows researchers to gauge the depth and variation of entire transcriptomes. However, now as rare transcripts can be detected that are present in cells at single copies, more advanced computational tools are needed to accurately annotate and profile them. miRNAs are 22 nucleotide small RNAs (sRNAs) that post-transcriptionally reduce the output of protein coding genes. They have established roles in numerous biological processes, including cancers and other diseases. During miRNA biogenesis, the sRNAs are sequentially cleaved from precursor molecules that have a characteristic hairpin RNA structure. The vast majority of new miRNA genes that are discovered are mined from small RNA sequencing (sRNA-seq), which can detect more than a billion RNAs in a single run. However, given that many of the detected RNAs are degradation products from all types of transcripts, the accurate identification of miRNAs remain a non-trivial computational problem. Here we review the tools available to predict animal miRNAs from sRNA sequencing data. We present tools for generalist and specialist use cases, including prediction from massively pooled data or in species without reference genome. We also present wet-lab methods used to validate predicted miRNAs, and approaches to computationally benchmark prediction accuracy. For each tool, we reference validation experiments and benchmarking efforts. Last, we discuss the future of the field.http://journal.frontiersin.org/Journal/10.3389/fbioe.2015.00007/fullmicroRNAmiRNAgene predictionnext-generation sequencing dataSmall RNA sequencing
collection DOAJ
language English
format Article
sources DOAJ
author Wenjing eKang
Marc Riemer Friedländer
spellingShingle Wenjing eKang
Marc Riemer Friedländer
Computational prediction of miRNA genes from small RNA sequencing data
Frontiers in Bioengineering and Biotechnology
microRNA
miRNA
gene prediction
next-generation sequencing data
Small RNA sequencing
author_facet Wenjing eKang
Marc Riemer Friedländer
author_sort Wenjing eKang
title Computational prediction of miRNA genes from small RNA sequencing data
title_short Computational prediction of miRNA genes from small RNA sequencing data
title_full Computational prediction of miRNA genes from small RNA sequencing data
title_fullStr Computational prediction of miRNA genes from small RNA sequencing data
title_full_unstemmed Computational prediction of miRNA genes from small RNA sequencing data
title_sort computational prediction of mirna genes from small rna sequencing data
publisher Frontiers Media S.A.
series Frontiers in Bioengineering and Biotechnology
issn 2296-4185
publishDate 2015-01-01
description Next-generation sequencing now for the first time allows researchers to gauge the depth and variation of entire transcriptomes. However, now as rare transcripts can be detected that are present in cells at single copies, more advanced computational tools are needed to accurately annotate and profile them. miRNAs are 22 nucleotide small RNAs (sRNAs) that post-transcriptionally reduce the output of protein coding genes. They have established roles in numerous biological processes, including cancers and other diseases. During miRNA biogenesis, the sRNAs are sequentially cleaved from precursor molecules that have a characteristic hairpin RNA structure. The vast majority of new miRNA genes that are discovered are mined from small RNA sequencing (sRNA-seq), which can detect more than a billion RNAs in a single run. However, given that many of the detected RNAs are degradation products from all types of transcripts, the accurate identification of miRNAs remain a non-trivial computational problem. Here we review the tools available to predict animal miRNAs from sRNA sequencing data. We present tools for generalist and specialist use cases, including prediction from massively pooled data or in species without reference genome. We also present wet-lab methods used to validate predicted miRNAs, and approaches to computationally benchmark prediction accuracy. For each tool, we reference validation experiments and benchmarking efforts. Last, we discuss the future of the field.
topic microRNA
miRNA
gene prediction
next-generation sequencing data
Small RNA sequencing
url http://journal.frontiersin.org/Journal/10.3389/fbioe.2015.00007/full
work_keys_str_mv AT wenjingekang computationalpredictionofmirnagenesfromsmallrnasequencingdata
AT marcriemerfriedlander computationalpredictionofmirnagenesfromsmallrnasequencingdata
_version_ 1726000352186073088