Separating pseudo-microRNAs from true microRNAs

MicroRNAs are small RNA molecules that regulate gene expression in cells. They are derived from hairpin shaped RNA transcripts, and about 50 \% of microRNA genes are localized in genomic regions that are associated with cancer. There are numerous other natural occurring RNA molecules that also take...

Full description

Bibliographic Details
Main Author: Holst, Frederik Klokk
Format: Others
Language:English
Published: Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap 2013
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-23358
id ndltd-UPSALLA1-oai-DiVA.org-ntnu-23358
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-ntnu-233582013-11-02T04:45:32ZSeparating pseudo-microRNAs from true microRNAsengHolst, Frederik KlokkNorges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskapInstitutt for datateknikk og informasjonsvitenskap2013MicroRNAs are small RNA molecules that regulate gene expression in cells. They are derived from hairpin shaped RNA transcripts, and about 50 \% of microRNA genes are localized in genomic regions that are associated with cancer. There are numerous other natural occurring RNA molecules that also take shape as hairpins. Being able to distinguish between these molecules and real microRNAs is vital to understand the nature of microRNAs.The goal of this thesis has been to construct a classifier that based on existing features is able to predict whether a hairpin shaped RNA molecule is a microRNA or a pseudo-microRNA. In addition the features in use have been analyzed to see which of these features are the most important for Microprocessor processing, and microRNA classification.I present a classifier that is able to distinguish between real and pseudo-microRNAs with high certainty for mus musculus microRNAs. This classifier is based on feature information constructed from the output of another classifier that predicts the Microprocessor cut site of microRNAs. The features used by this classifier have been analyzed using feature elimination. Indications show that there are specific positions within the flanking regions of a microRNA substrate that are important for Drosha recognition of the substrate. Feature analysis has also been performed for the microRNA classifier, and discoveries were made that indicate that microRNAs can be distinguished from other hairpin RNAs by the fact that microRNAs have one clear cut site candidate where the other hairpin shaped RNAs might have many possible candidates. This information will hopefully further assists the search for novel microRNAs, and also to help reanalyze existing microRNAs to verify that they are in fact microRNAs. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-23358Local ntnudaim:9936application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description MicroRNAs are small RNA molecules that regulate gene expression in cells. They are derived from hairpin shaped RNA transcripts, and about 50 \% of microRNA genes are localized in genomic regions that are associated with cancer. There are numerous other natural occurring RNA molecules that also take shape as hairpins. Being able to distinguish between these molecules and real microRNAs is vital to understand the nature of microRNAs.The goal of this thesis has been to construct a classifier that based on existing features is able to predict whether a hairpin shaped RNA molecule is a microRNA or a pseudo-microRNA. In addition the features in use have been analyzed to see which of these features are the most important for Microprocessor processing, and microRNA classification.I present a classifier that is able to distinguish between real and pseudo-microRNAs with high certainty for mus musculus microRNAs. This classifier is based on feature information constructed from the output of another classifier that predicts the Microprocessor cut site of microRNAs. The features used by this classifier have been analyzed using feature elimination. Indications show that there are specific positions within the flanking regions of a microRNA substrate that are important for Drosha recognition of the substrate. Feature analysis has also been performed for the microRNA classifier, and discoveries were made that indicate that microRNAs can be distinguished from other hairpin RNAs by the fact that microRNAs have one clear cut site candidate where the other hairpin shaped RNAs might have many possible candidates. This information will hopefully further assists the search for novel microRNAs, and also to help reanalyze existing microRNAs to verify that they are in fact microRNAs.
author Holst, Frederik Klokk
spellingShingle Holst, Frederik Klokk
Separating pseudo-microRNAs from true microRNAs
author_facet Holst, Frederik Klokk
author_sort Holst, Frederik Klokk
title Separating pseudo-microRNAs from true microRNAs
title_short Separating pseudo-microRNAs from true microRNAs
title_full Separating pseudo-microRNAs from true microRNAs
title_fullStr Separating pseudo-microRNAs from true microRNAs
title_full_unstemmed Separating pseudo-microRNAs from true microRNAs
title_sort separating pseudo-micrornas from true micrornas
publisher Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
publishDate 2013
url http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-23358
work_keys_str_mv AT holstfrederikklokk separatingpseudomicrornasfromtruemicrornas
_version_ 1716613090784575488