C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models

Clustered regularly interspaced short palindromic repeat (CRISPR) technology is the most important tool in gene editing, it can be used to target any gene using guide RNA and Cas enzyme, one limitation of CRISPR systems is low guide RNA (gRNA) activity, therefore it is highly important to predict it...

Full description

Bibliographic Details
Main Authors: Zubaida Sa'id Ameen, Mehmet Ozsoz, Auwalu Saleh Mubarak, Fadi Al Turjman, Sertan Serte
Format: Article
Language:English
Published: Elsevier 2021-08-01
Series:Alexandria Engineering Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110016821000788
id doaj-b5207f11f5704de69b5082cae41529ec
record_format Article
spelling doaj-b5207f11f5704de69b5082cae41529ec2021-06-02T18:38:47ZengElsevierAlexandria Engineering Journal1110-01682021-08-0160435013508C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning modelsZubaida Sa'id Ameen0Mehmet Ozsoz1Auwalu Saleh Mubarak2Fadi Al Turjman3Sertan Serte4Biomedical Engineering Department, Near East University, Nicosia, Cyprus; Corresponding author.Biomedical Engineering Department, Near East University, Nicosia, CyprusElectrical and Electronic Engineering Department, Near East University, Nicosia, CyprusArtificial Intelligence Engineering Department, Near East University, Nicosia, CyprusElectrical and Electronic Engineering Department, Near East University, Nicosia, CyprusClustered regularly interspaced short palindromic repeat (CRISPR) technology is the most important tool in gene editing, it can be used to target any gene using guide RNA and Cas enzyme, one limitation of CRISPR systems is low guide RNA (gRNA) activity, therefore it is highly important to predict its gRNA activity. The activity of gRNA can be determined by measuring the score for the frequency of insertion or deletion (indel). In this work, CNN was optimized by changing the convolution layer depth and filter kernel size to determine how well the model will perform, also, we compared traditional Multiple Linear Regression (MLR), Convolutional Neural Network (CNN) and combine CNN with Support Vector Regressor (SVR) to form a hybrid model CNN-SVR for the prediction of gRNA activity. Based on the Spearman Correlation (SC) the hybrid model turns out to outperform state of the art model by an increase of up to 40% in predicting gRNA activity. Finally, we predicted the indel frequency for gRNA sequences used for detection of COVID-19 to validate the hybrid model, this will assist in choosing the best gRNA for detection COVID-19 virus using CRISPR/Cas12 system.http://www.sciencedirect.com/science/article/pii/S1110016821000788CRISPRGuide RNA activityCOVID-19Indel frequencyDeep learningConvolutional Neural Network
collection DOAJ
language English
format Article
sources DOAJ
author Zubaida Sa'id Ameen
Mehmet Ozsoz
Auwalu Saleh Mubarak
Fadi Al Turjman
Sertan Serte
spellingShingle Zubaida Sa'id Ameen
Mehmet Ozsoz
Auwalu Saleh Mubarak
Fadi Al Turjman
Sertan Serte
C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
Alexandria Engineering Journal
CRISPR
Guide RNA activity
COVID-19
Indel frequency
Deep learning
Convolutional Neural Network
author_facet Zubaida Sa'id Ameen
Mehmet Ozsoz
Auwalu Saleh Mubarak
Fadi Al Turjman
Sertan Serte
author_sort Zubaida Sa'id Ameen
title C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
title_short C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
title_full C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
title_fullStr C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
title_full_unstemmed C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
title_sort c-svr crispr: prediction of crispr/cas12 guiderna activity using deep learning models
publisher Elsevier
series Alexandria Engineering Journal
issn 1110-0168
publishDate 2021-08-01
description Clustered regularly interspaced short palindromic repeat (CRISPR) technology is the most important tool in gene editing, it can be used to target any gene using guide RNA and Cas enzyme, one limitation of CRISPR systems is low guide RNA (gRNA) activity, therefore it is highly important to predict its gRNA activity. The activity of gRNA can be determined by measuring the score for the frequency of insertion or deletion (indel). In this work, CNN was optimized by changing the convolution layer depth and filter kernel size to determine how well the model will perform, also, we compared traditional Multiple Linear Regression (MLR), Convolutional Neural Network (CNN) and combine CNN with Support Vector Regressor (SVR) to form a hybrid model CNN-SVR for the prediction of gRNA activity. Based on the Spearman Correlation (SC) the hybrid model turns out to outperform state of the art model by an increase of up to 40% in predicting gRNA activity. Finally, we predicted the indel frequency for gRNA sequences used for detection of COVID-19 to validate the hybrid model, this will assist in choosing the best gRNA for detection COVID-19 virus using CRISPR/Cas12 system.
topic CRISPR
Guide RNA activity
COVID-19
Indel frequency
Deep learning
Convolutional Neural Network
url http://www.sciencedirect.com/science/article/pii/S1110016821000788
work_keys_str_mv AT zubaidasaidameen csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels
AT mehmetozsoz csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels
AT auwalusalehmubarak csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels
AT fadialturjman csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels
AT sertanserte csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels
_version_ 1721402215967490048