C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models
Clustered regularly interspaced short palindromic repeat (CRISPR) technology is the most important tool in gene editing, it can be used to target any gene using guide RNA and Cas enzyme, one limitation of CRISPR systems is low guide RNA (gRNA) activity, therefore it is highly important to predict it...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2021-08-01
|
Series: | Alexandria Engineering Journal |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1110016821000788 |
id |
doaj-b5207f11f5704de69b5082cae41529ec |
---|---|
record_format |
Article |
spelling |
doaj-b5207f11f5704de69b5082cae41529ec2021-06-02T18:38:47ZengElsevierAlexandria Engineering Journal1110-01682021-08-0160435013508C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning modelsZubaida Sa'id Ameen0Mehmet Ozsoz1Auwalu Saleh Mubarak2Fadi Al Turjman3Sertan Serte4Biomedical Engineering Department, Near East University, Nicosia, Cyprus; Corresponding author.Biomedical Engineering Department, Near East University, Nicosia, CyprusElectrical and Electronic Engineering Department, Near East University, Nicosia, CyprusArtificial Intelligence Engineering Department, Near East University, Nicosia, CyprusElectrical and Electronic Engineering Department, Near East University, Nicosia, CyprusClustered regularly interspaced short palindromic repeat (CRISPR) technology is the most important tool in gene editing, it can be used to target any gene using guide RNA and Cas enzyme, one limitation of CRISPR systems is low guide RNA (gRNA) activity, therefore it is highly important to predict its gRNA activity. The activity of gRNA can be determined by measuring the score for the frequency of insertion or deletion (indel). In this work, CNN was optimized by changing the convolution layer depth and filter kernel size to determine how well the model will perform, also, we compared traditional Multiple Linear Regression (MLR), Convolutional Neural Network (CNN) and combine CNN with Support Vector Regressor (SVR) to form a hybrid model CNN-SVR for the prediction of gRNA activity. Based on the Spearman Correlation (SC) the hybrid model turns out to outperform state of the art model by an increase of up to 40% in predicting gRNA activity. Finally, we predicted the indel frequency for gRNA sequences used for detection of COVID-19 to validate the hybrid model, this will assist in choosing the best gRNA for detection COVID-19 virus using CRISPR/Cas12 system.http://www.sciencedirect.com/science/article/pii/S1110016821000788CRISPRGuide RNA activityCOVID-19Indel frequencyDeep learningConvolutional Neural Network |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zubaida Sa'id Ameen Mehmet Ozsoz Auwalu Saleh Mubarak Fadi Al Turjman Sertan Serte |
spellingShingle |
Zubaida Sa'id Ameen Mehmet Ozsoz Auwalu Saleh Mubarak Fadi Al Turjman Sertan Serte C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models Alexandria Engineering Journal CRISPR Guide RNA activity COVID-19 Indel frequency Deep learning Convolutional Neural Network |
author_facet |
Zubaida Sa'id Ameen Mehmet Ozsoz Auwalu Saleh Mubarak Fadi Al Turjman Sertan Serte |
author_sort |
Zubaida Sa'id Ameen |
title |
C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models |
title_short |
C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models |
title_full |
C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models |
title_fullStr |
C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models |
title_full_unstemmed |
C-SVR Crispr: Prediction of CRISPR/Cas12 guideRNA activity using deep learning models |
title_sort |
c-svr crispr: prediction of crispr/cas12 guiderna activity using deep learning models |
publisher |
Elsevier |
series |
Alexandria Engineering Journal |
issn |
1110-0168 |
publishDate |
2021-08-01 |
description |
Clustered regularly interspaced short palindromic repeat (CRISPR) technology is the most important tool in gene editing, it can be used to target any gene using guide RNA and Cas enzyme, one limitation of CRISPR systems is low guide RNA (gRNA) activity, therefore it is highly important to predict its gRNA activity. The activity of gRNA can be determined by measuring the score for the frequency of insertion or deletion (indel). In this work, CNN was optimized by changing the convolution layer depth and filter kernel size to determine how well the model will perform, also, we compared traditional Multiple Linear Regression (MLR), Convolutional Neural Network (CNN) and combine CNN with Support Vector Regressor (SVR) to form a hybrid model CNN-SVR for the prediction of gRNA activity. Based on the Spearman Correlation (SC) the hybrid model turns out to outperform state of the art model by an increase of up to 40% in predicting gRNA activity. Finally, we predicted the indel frequency for gRNA sequences used for detection of COVID-19 to validate the hybrid model, this will assist in choosing the best gRNA for detection COVID-19 virus using CRISPR/Cas12 system. |
topic |
CRISPR Guide RNA activity COVID-19 Indel frequency Deep learning Convolutional Neural Network |
url |
http://www.sciencedirect.com/science/article/pii/S1110016821000788 |
work_keys_str_mv |
AT zubaidasaidameen csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels AT mehmetozsoz csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels AT auwalusalehmubarak csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels AT fadialturjman csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels AT sertanserte csvrcrisprpredictionofcrisprcas12guidernaactivityusingdeeplearningmodels |
_version_ |
1721402215967490048 |