Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.
Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining th...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2017-11-01
|
Series: | PLoS Computational Biology |
Online Access: | http://europepmc.org/articles/PMC5690697?pdf=render |
id |
doaj-604ebd51417a4f2ca5a5f03bbd34025f |
---|---|
record_format |
Article |
spelling |
doaj-604ebd51417a4f2ca5a5f03bbd34025f2020-11-25T01:18:25ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582017-11-011311e100582710.1371/journal.pcbi.1005827Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.Michael F SlomaDavid H MathewsPrediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.http://europepmc.org/articles/PMC5690697?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Michael F Sloma David H Mathews |
spellingShingle |
Michael F Sloma David H Mathews Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs. PLoS Computational Biology |
author_facet |
Michael F Sloma David H Mathews |
author_sort |
Michael F Sloma |
title |
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs. |
title_short |
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs. |
title_full |
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs. |
title_fullStr |
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs. |
title_full_unstemmed |
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs. |
title_sort |
base pair probability estimates improve the prediction accuracy of rna non-canonical base pairs. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS Computational Biology |
issn |
1553-734X 1553-7358 |
publishDate |
2017-11-01 |
description |
Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package. |
url |
http://europepmc.org/articles/PMC5690697?pdf=render |
work_keys_str_mv |
AT michaelfsloma basepairprobabilityestimatesimprovethepredictionaccuracyofrnanoncanonicalbasepairs AT davidhmathews basepairprobabilityestimatesimprovethepredictionaccuracyofrnanoncanonicalbasepairs |
_version_ |
1725142670865596416 |