Statistical Analysis of PAR-CLIP data

From creation to its degradation, the RNA molecule is the action field of many binding proteins with different roles in regulation and RNA metabolism. Since these proteins are involved in a large number of processes, a variety of diseases are related to abnormalities occurring within the binding mec...

Full description

Bibliographic Details
Main Author: Golumbeanu, Monica
Format: Others
Language:English
Published: KTH, Beräkningsbiologi, CB 2013
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-124347
Description
Summary:From creation to its degradation, the RNA molecule is the action field of many binding proteins with different roles in regulation and RNA metabolism. Since these proteins are involved in a large number of processes, a variety of diseases are related to abnormalities occurring within the binding mechanisms. One of the experimental methods for detecting the binding sites of these proteins is PAR-CLIP built on the next generation sequencing technology. Due to its size and intrinsic noise, PAR-CLIP data analysis requires appropriate pre-processing and thorough statistical analysis. The present work has two main goals. First, to develop a modular pipeline for preprocessing PAR-CLIP data and extracting necessary signals for further analysis. Second, to devise a novel statistical model in order to carry out inference about presence of protein binding sites based on the signals extracted in the pre-processing step.