Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm
Abstract Background The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2017-11-01
|
Series: | BMC Genomics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12864-017-4225-1 |
id |
doaj-337bfacf641b4a77bd312071194b09bd |
---|---|
record_format |
Article |
spelling |
doaj-337bfacf641b4a77bd312071194b09bd2020-11-24T22:59:41ZengBMCBMC Genomics1471-21642017-11-0118S9313810.1186/s12864-017-4225-1Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithmHong Zhou0Michael Zhou1Daisy Li2Joseph Manthey3Ekaterina Lioutikova4Hong Wang5Xiao Zeng6Department of Mathematical Science, University of Saint JosephHall High SchoolHall High SchoolDepartment of Mathematical Science, University of Saint JosephDepartment of Mathematical Science, University of Saint JosephSusan L. Cullman Laboratory for Cancer Research, Department of Chemical Biology and Centre for Cancer Prevention Research, Ernest Mario School of Pharmacy, Rutgers, The State University of New JerseyPBSG, LLCAbstract Background The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introduction of wanted mutations. Unfortunately, it has been reported repeatedly that the sgRNA can also guide Cas9 to off-target sites where the DNA sequence is homologous to sgRNA. Results Using human genome and Streptococcus pyogenes Cas9 (SpCas9) as an example, this article mathematically analyzed the probabilities of off-target homologies of sgRNAs and discovered that for large genome size such as human genome, potential off-target homologies are inevitable for sgRNA selection. A highly efficient computationl algorithm was developed for whole genome sgRNA design and off-target homology searches. By means of a dynamically constructed sequence-indexed database and a simplified sequence alignment method, this algorithm achieves very high efficiency while guaranteeing the identification of all existing potential off-target homologies. Via this algorithm, 1,876,775 sgRNAs were designed for the 19,153 human mRNA genes and only two sgRNAs were found to be free of off-target homology. Conclusions By means of the novel and efficient sgRNA homology search algorithm introduced in this article, genome wide sgRNA design and off-target analysis were conducted and the results confirmed the mathematical analysis that for a sgRNA sequence, it is almost impossible to escape potential off-target homologies. Future innovations on the CRISPR Cas9 gene editing technology need to focus on how to eliminate the Cas9 off-target activity.http://link.springer.com/article/10.1186/s12864-017-4225-1sgRNAOff-target homologyCrisprCas9Computational algorithmGenome wide |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hong Zhou Michael Zhou Daisy Li Joseph Manthey Ekaterina Lioutikova Hong Wang Xiao Zeng |
spellingShingle |
Hong Zhou Michael Zhou Daisy Li Joseph Manthey Ekaterina Lioutikova Hong Wang Xiao Zeng Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm BMC Genomics sgRNA Off-target homology Crispr Cas9 Computational algorithm Genome wide |
author_facet |
Hong Zhou Michael Zhou Daisy Li Joseph Manthey Ekaterina Lioutikova Hong Wang Xiao Zeng |
author_sort |
Hong Zhou |
title |
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm |
title_short |
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm |
title_full |
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm |
title_fullStr |
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm |
title_full_unstemmed |
Whole genome analysis of CRISPR Cas9 sgRNA off-target homologies via an efficient computational algorithm |
title_sort |
whole genome analysis of crispr cas9 sgrna off-target homologies via an efficient computational algorithm |
publisher |
BMC |
series |
BMC Genomics |
issn |
1471-2164 |
publishDate |
2017-11-01 |
description |
Abstract Background The beauty and power of the genome editing mechanism, CRISPR Cas9 endonuclease system, lies in the fact that it is RNA-programmable such that Cas9 can be guided to any genomic loci complementary to a 20-nt RNA, single guide RNA (sgRNA), to cleave double stranded DNA, allowing the introduction of wanted mutations. Unfortunately, it has been reported repeatedly that the sgRNA can also guide Cas9 to off-target sites where the DNA sequence is homologous to sgRNA. Results Using human genome and Streptococcus pyogenes Cas9 (SpCas9) as an example, this article mathematically analyzed the probabilities of off-target homologies of sgRNAs and discovered that for large genome size such as human genome, potential off-target homologies are inevitable for sgRNA selection. A highly efficient computationl algorithm was developed for whole genome sgRNA design and off-target homology searches. By means of a dynamically constructed sequence-indexed database and a simplified sequence alignment method, this algorithm achieves very high efficiency while guaranteeing the identification of all existing potential off-target homologies. Via this algorithm, 1,876,775 sgRNAs were designed for the 19,153 human mRNA genes and only two sgRNAs were found to be free of off-target homology. Conclusions By means of the novel and efficient sgRNA homology search algorithm introduced in this article, genome wide sgRNA design and off-target analysis were conducted and the results confirmed the mathematical analysis that for a sgRNA sequence, it is almost impossible to escape potential off-target homologies. Future innovations on the CRISPR Cas9 gene editing technology need to focus on how to eliminate the Cas9 off-target activity. |
topic |
sgRNA Off-target homology Crispr Cas9 Computational algorithm Genome wide |
url |
http://link.springer.com/article/10.1186/s12864-017-4225-1 |
work_keys_str_mv |
AT hongzhou wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm AT michaelzhou wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm AT daisyli wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm AT josephmanthey wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm AT ekaterinalioutikova wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm AT hongwang wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm AT xiaozeng wholegenomeanalysisofcrisprcas9sgrnaofftargethomologiesviaanefficientcomputationalalgorithm |
_version_ |
1725644185788219392 |