Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.

Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining th...

Full description

Bibliographic Details
Main Authors: Michael F Sloma, David H Mathews
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-11-01
Series:PLoS Computational Biology
Online Access:http://europepmc.org/articles/PMC5690697?pdf=render
id doaj-604ebd51417a4f2ca5a5f03bbd34025f
record_format Article
spelling doaj-604ebd51417a4f2ca5a5f03bbd34025f2020-11-25T01:18:25ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-11-011311e100582710.1371/journal.pcbi.1005827Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.Michael F SlomaDavid H MathewsPrediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.http://europepmc.org/articles/PMC5690697?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Michael F Sloma
David H Mathews
spellingShingle Michael F Sloma
David H Mathews
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
PLoS Computational Biology
author_facet Michael F Sloma
David H Mathews
author_sort Michael F Sloma
title Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
title_short Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
title_full Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
title_fullStr Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
title_full_unstemmed Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
title_sort base pair probability estimates improve the prediction accuracy of rna non-canonical base pairs.
publisher Public Library of Science (PLoS)
series PLoS Computational Biology
issn 1553-734X
1553-7358
publishDate 2017-11-01
description Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.
url http://europepmc.org/articles/PMC5690697?pdf=render
work_keys_str_mv AT michaelfsloma basepairprobabilityestimatesimprovethepredictionaccuracyofrnanoncanonicalbasepairs
AT davidhmathews basepairprobabilityestimatesimprovethepredictionaccuracyofrnanoncanonicalbasepairs
_version_ 1725142670865596416