Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm

Abstract Background The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the...

Full description

Bibliographic Details
Main Authors: Hong Zhou, Michael Zhou, Daisy Li, Joseph Manthey, Ekaterina Lioutikova, Hong Wang, Xiao Zeng
Format: Article
Language:English
Published: BMC 2017-11-01
Series:BMC Genomics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12864-017-4225-1
id doaj-337bfacf641b4a77bd312071194b09bd
record_format Article
spelling doaj-337bfacf641b4a77bd312071194b09bd2020-11-24T22:59:41ZengBMCBMC Genomics1471-21642017-11-0118S9313810.1186/s12864-017-4225-1Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithmHong Zhou0Michael Zhou1Daisy Li2Joseph Manthey3Ekaterina Lioutikova4Hong Wang5Xiao Zeng6Department of Mathematical Science, University of Saint JosephHall High SchoolHall High SchoolDepartment of Mathematical Science, University of Saint JosephDepartment of Mathematical Science, University of Saint JosephSusan L. Cullman Laboratory for Cancer Research, Department of Chemical Biology and Centre for Cancer Prevention Research, Ernest Mario School of Pharmacy, Rutgers, The State University of New JerseyPBSG, LLCAbstract Background The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introduction of wanted mutations. Unfortunately, it has been reported repeatedly that the sgRNA can also guide Cas9 to off-target sites where the DNA sequence is homologous to sgRNA. Results Using human genome and Streptococcus pyogenes Cas9 (SpCas9) as an example, this article mathematically analyzed the probabilities of off-target homologies of sgRNAs and discovered that for large genome size such as human genome, potential off-target homologies are inevitable for sgRNA selection. A highly efficient computationl algorithm was developed for whole genome sgRNA design and off-target homology searches. By means of a dynamically constructed sequence-indexed database and a simplified sequence alignment method, this algorithm achieves very high efficiency while guaranteeing the identification of all existing potential off-target homologies. Via this algorithm, 1,876,775 sgRNAs were designed for the 19,153 human mRNA genes and only two sgRNAs were found to be free of off-target homology. Conclusions By means of the novel and efficient sgRNA homology search algorithm introduced in this article, genome wide sgRNA design and off-target analysis were conducted and the results confirmed the mathematical analysis that for a sgRNA sequence, it is almost impossible to escape potential off-target homologies. Future innovations on the CRISPR Cas9 gene editing technology need to focus on how to eliminate the Cas9 off-target activity.http://link.springer.com/article/10.1186/s12864-017-4225-1sgRNAOff-target homologyCrisprCas9Computational algorithmGenome wide
collection DOAJ
language English
format Article
sources DOAJ
author Hong Zhou
Michael Zhou
Daisy Li
Joseph Manthey
Ekaterina Lioutikova
Hong Wang
Xiao Zeng
spellingShingle Hong Zhou
Michael Zhou
Daisy Li
Joseph Manthey
Ekaterina Lioutikova
Hong Wang
Xiao Zeng
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
BMC Genomics
sgRNA
Off-target homology
Crispr
Cas9
Computational algorithm
Genome wide
author_facet Hong Zhou
Michael Zhou
Daisy Li
Joseph Manthey
Ekaterina Lioutikova
Hong Wang
Xiao Zeng
author_sort Hong Zhou
title Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_short Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_full Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_fullStr Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_full_unstemmed Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
title_sort whole genome analysis of crispr cas9 sgrna off-target homologies via an efficient computational algorithm
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2017-11-01
description Abstract Background The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introduction of wanted mutations. Unfortunately, it has been reported repeatedly that the sgRNA can also guide Cas9 to off-target sites where the DNA sequence is homologous to sgRNA. Results Using human genome and Streptococcus pyogenes Cas9 (SpCas9) as an example, this article mathematically analyzed the probabilities of off-target homologies of sgRNAs and discovered that for large genome size such as human genome, potential off-target homologies are inevitable for sgRNA selection. A highly efficient computationl algorithm was developed for whole genome sgRNA design and off-target homology searches. By means of a dynamically constructed sequence-indexed database and a simplified sequence alignment method, this algorithm achieves very high efficiency while guaranteeing the identification of all existing potential off-target homologies. Via this algorithm, 1,876,775 sgRNAs were designed for the 19,153 human mRNA genes and only two sgRNAs were found to be free of off-target homology. Conclusions By means of the novel and efficient sgRNA homology search algorithm introduced in this article, genome wide sgRNA design and off-target analysis were conducted and the results confirmed the mathematical analysis that for a sgRNA sequence, it is almost impossible to escape potential off-target homologies. Future innovations on the CRISPR Cas9 gene editing technology need to focus on how to eliminate the Cas9 off-target activity.
topic sgRNA
Off-target homology
Crispr
Cas9
Computational algorithm
Genome wide
url http://link.springer.com/article/10.1186/s12864-017-4225-1
work_keys_str_mv AT hongzhou wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT michaelzhou wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT daisyli wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT josephmanthey wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT ekaterinalioutikova wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT hongwang wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
AT xiaozeng wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm
_version_ 1725644185788219392