On an enhancement of RNA probing data using information theory

Abstract Identifying the secondary structure of an RNA is crucial for understanding its diverse regulatory functions. This paper focuses on how to enhance target identification in a Boltzmann ensemble of structures via chemical probing data. We employ an information-theoretic approach to solve the p...

Full description

Bibliographic Details
Main Authors: Thomas J. X. Li, Christian M. Reidys
Format: Article
Language:English
Published: BMC 2020-08-01
Series:Algorithms for Molecular Biology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13015-020-00176-z
id doaj-203fd45f52a44d95843883407b66e528
record_format Article
spelling doaj-203fd45f52a44d95843883407b66e5282020-11-25T02:59:16ZengBMCAlgorithms for Molecular Biology1748-71882020-08-0115112210.1186/s13015-020-00176-zOn an enhancement of RNA probing data using information theoryThomas J. X. Li0Christian M. Reidys1Biocomplexity Institute & Initiative, University of VirginiaBiocomplexity Institute & Initiative, University of VirginiaAbstract Identifying the secondary structure of an RNA is crucial for understanding its diverse regulatory functions. This paper focuses on how to enhance target identification in a Boltzmann ensemble of structures via chemical probing data. We employ an information-theoretic approach to solve the problem, via considering a variant of the Rényi-Ulam game. Our framework is centered around the ensemble tree, a hierarchical bi-partition of the input ensemble, that is constructed by recursively querying about whether or not a base pair of maximum information entropy is contained in the target. These queries are answered via relating local with global probing data, employing the modularity in RNA secondary structures. We present that leaves of the tree are comprised of sub-samples exhibiting a distinguished structure with high probability. In particular, for a Boltzmann ensemble incorporating probing data, which is well established in the literature, the probability of our framework correctly identifying the target in the leaf is greater than $$90\%$$ 90 % .http://link.springer.com/article/10.1186/s13015-020-00176-zRNA structureChemical probingRényi-Ulam gameInformation theory
collection DOAJ
language English
format Article
sources DOAJ
author Thomas J. X. Li
Christian M. Reidys
spellingShingle Thomas J. X. Li
Christian M. Reidys
On an enhancement of RNA probing data using information theory
Algorithms for Molecular Biology
RNA structure
Chemical probing
Rényi-Ulam game
Information theory
author_facet Thomas J. X. Li
Christian M. Reidys
author_sort Thomas J. X. Li
title On an enhancement of RNA probing data using information theory
title_short On an enhancement of RNA probing data using information theory
title_full On an enhancement of RNA probing data using information theory
title_fullStr On an enhancement of RNA probing data using information theory
title_full_unstemmed On an enhancement of RNA probing data using information theory
title_sort on an enhancement of rna probing data using information theory
publisher BMC
series Algorithms for Molecular Biology
issn 1748-7188
publishDate 2020-08-01
description Abstract Identifying the secondary structure of an RNA is crucial for understanding its diverse regulatory functions. This paper focuses on how to enhance target identification in a Boltzmann ensemble of structures via chemical probing data. We employ an information-theoretic approach to solve the problem, via considering a variant of the Rényi-Ulam game. Our framework is centered around the ensemble tree, a hierarchical bi-partition of the input ensemble, that is constructed by recursively querying about whether or not a base pair of maximum information entropy is contained in the target. These queries are answered via relating local with global probing data, employing the modularity in RNA secondary structures. We present that leaves of the tree are comprised of sub-samples exhibiting a distinguished structure with high probability. In particular, for a Boltzmann ensemble incorporating probing data, which is well established in the literature, the probability of our framework correctly identifying the target in the leaf is greater than $$90\%$$ 90 % .
topic RNA structure
Chemical probing
Rényi-Ulam game
Information theory
url http://link.springer.com/article/10.1186/s13015-020-00176-z
work_keys_str_mv AT thomasjxli onanenhancementofrnaprobingdatausinginformationtheory
AT christianmreidys onanenhancementofrnaprobingdatausinginformationtheory
_version_ 1724703416334155776