Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences

The accurate prediction of an RNA secondary structure from its sequence will enhance the experimental design and interpretation for the increasing number of scientists that study RNA. While the computer programs that make these predictions have improved, additional improvements are necessary, in par...

Full description

Bibliographic Details
Main Author: Gardner, David Paul
Format: Others
Language:English
Published: 2012
Subjects:
Online Access:http://hdl.handle.net/2152/ETD-UT-2012-05-5639
id ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2012-05-5639
record_format oai_dc
spelling ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2012-05-56392015-09-20T17:06:53ZImproving the prediction of RNA secondary structure and automatic alignment of RNa sequencesGardner, David PaulRNA structureRNA alignmentRNA foldingComparative analysisThe accurate prediction of an RNA secondary structure from its sequence will enhance the experimental design and interpretation for the increasing number of scientists that study RNA. While the computer programs that make these predictions have improved, additional improvements are necessary, in particular for larger RNAs. The first major section of this dissertation is concerned with improving the prediction accuracy of RNA secondary structures by generating new energetic parameters and evaluating a new RNA folding model. Statistical potentials for hairpin and internal loops produce significantly higher prediction accuracy when compared with nine other folding programs. While more improvements can be made to the energetic parameters used by secondary structure folding programs, I believe that a new approach is also necessary. I describe a RNA folding model that is predicated on a large body of computational and experimental work. This model includes energetics, contact distance, competition and a folding pathway. Each component of this folding model is evaluated and substantiated for its validity. The statistical potentials were created with comparative analysis. Comparative analysis requires the creation of highly accurate multiple RNA sequence alignments. The second major section of this dissertation is focused on my template-based sequence aligner, CRWAlign. Multiple sequence aligners generally run into problems when the pairwise sequence identity drops too low. By utilizing multiple dimensions of data to establish a profile for each position in a template alignment, CRWAlign is able to align new sequences with high accuracy even for pairs of sequence with low identity.text2012-07-02T22:21:00Z2012-07-02T22:21:00Z2012-052012-07-02May 20122012-07-02T22:21:16Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2012-05-56392152/ETD-UT-2012-05-5639eng
collection NDLTD
language English
format Others
sources NDLTD
topic RNA structure
RNA alignment
RNA folding
Comparative analysis
spellingShingle RNA structure
RNA alignment
RNA folding
Comparative analysis
Gardner, David Paul
Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences
description The accurate prediction of an RNA secondary structure from its sequence will enhance the experimental design and interpretation for the increasing number of scientists that study RNA. While the computer programs that make these predictions have improved, additional improvements are necessary, in particular for larger RNAs. The first major section of this dissertation is concerned with improving the prediction accuracy of RNA secondary structures by generating new energetic parameters and evaluating a new RNA folding model. Statistical potentials for hairpin and internal loops produce significantly higher prediction accuracy when compared with nine other folding programs. While more improvements can be made to the energetic parameters used by secondary structure folding programs, I believe that a new approach is also necessary. I describe a RNA folding model that is predicated on a large body of computational and experimental work. This model includes energetics, contact distance, competition and a folding pathway. Each component of this folding model is evaluated and substantiated for its validity. The statistical potentials were created with comparative analysis. Comparative analysis requires the creation of highly accurate multiple RNA sequence alignments. The second major section of this dissertation is focused on my template-based sequence aligner, CRWAlign. Multiple sequence aligners generally run into problems when the pairwise sequence identity drops too low. By utilizing multiple dimensions of data to establish a profile for each position in a template alignment, CRWAlign is able to align new sequences with high accuracy even for pairs of sequence with low identity. === text
author Gardner, David Paul
author_facet Gardner, David Paul
author_sort Gardner, David Paul
title Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences
title_short Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences
title_full Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences
title_fullStr Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences
title_full_unstemmed Improving the prediction of RNA secondary structure and automatic alignment of RNa sequences
title_sort improving the prediction of rna secondary structure and automatic alignment of rna sequences
publishDate 2012
url http://hdl.handle.net/2152/ETD-UT-2012-05-5639
work_keys_str_mv AT gardnerdavidpaul improvingthepredictionofrnasecondarystructureandautomaticalignmentofrnasequences
_version_ 1716822561497546752