Branch Point Selection in RNA Splicing Using Deep Learning

Alternative splicing (AS) is a regulated process that takes place during gene expression by which a single gene may code for multiple proteins. This mechanism is controlled by a complex called spliceosome by which certain exons of a gene may be included in or excluded out from the final mRNA produce...

Full description

Bibliographic Details
Main Authors: Iman Nazari, Hilal Tayara, Kil To Chong
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8574900/
Description
Summary:Alternative splicing (AS) is a regulated process that takes place during gene expression by which a single gene may code for multiple proteins. This mechanism is controlled by a complex called spliceosome by which certain exons of a gene may be included in or excluded out from the final mRNA produced from that gene. In AS, at least three remarkable signals exist in introns and they are 5&#x2019; splice site (5&#x2019;ss), the donor ss where GU nucleotides are more frequently present, 3&#x2019;ss, the acceptor ss where AG nucleotides are more frequently present, and branch site. Generally, branch point site is located at 20 to 50 nucleotides upstream from the 3&#x2019;ss. In this paper, we identify the branch point location using a computational model based on deep learning. We propose a hybrid model based on a combination of dilated convolution neural network and recurrent neural network. Integrating additional inputs to the raw RNA sequence has been studied such as conservation, binding energy, and di-nucleotide. The proposed model has been evaluated on two publicly available datasets and outperformed the current state-of-the-art methods. More specifically, the proposed model achieved for the first dataset 97.29&#x0025; and 67.08&#x0025; of the area under curve (ROC-AUC) and the area under precision recall curve (prAUC), respectively, for the second dataset 96.86&#x0025; and 69.62&#x0025; of ROC-AUC and prAUC, respectively. In addition, pathogenic variants have been studied by the proposed model and agreed with the reported ones biologically. To study RNA branch point selection, an easy-to-use Web server has been established for free access at: <uri>https://home.jbnu.ac.kr/NSCL/rnabps.htm</uri>.
ISSN:2169-3536