Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms

Abstract Background DNBSEQ™ platforms are new massively parallel sequencing (MPS) platforms that use DNA nanoball technology. Use of data generated from DNBSEQ™ platforms to detect single nucleotide variants (SNVs) and small insertions and deletions (indels) has proven to be quite effective, while t...

Full description

Bibliographic Details
Main Authors: Junhua Rao, Lihua Peng, Xinming Liang, Hui Jiang, Chunyu Geng, Xia Zhao, Xin Liu, Guangyi Fan, Fang Chen, Feng Mu
Format: Article
Language:English
Published: BMC 2020-11-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-020-03859-x
id doaj-3270323e8075475daf1dd41c6834cb99
record_format Article
spelling doaj-3270323e8075475daf1dd41c6834cb992020-11-25T04:08:42ZengBMCBMC Bioinformatics1471-21052020-11-0121111410.1186/s12859-020-03859-xPerformance of copy number variants detection based on whole-genome sequencing by DNBSEQ platformsJunhua Rao0Lihua Peng1Xinming Liang2Hui Jiang3Chunyu Geng4Xia Zhao5Xin Liu6Guangyi Fan7Fang Chen8Feng Mu9MGI, BGI-ShenzhenBGI-ShenzhenMGI, BGI-ShenzhenMGI, BGI-ShenzhenMGI, BGI-ShenzhenMGI, BGI-ShenzhenBGI-ShenzhenBGI-Qingdao, BGI-ShenzhenMGI, BGI-ShenzhenMGI, BGI-ShenzhenAbstract Background DNBSEQ™ platforms are new massively parallel sequencing (MPS) platforms that use DNA nanoball technology. Use of data generated from DNBSEQ™ platforms to detect single nucleotide variants (SNVs) and small insertions and deletions (indels) has proven to be quite effective, while the feasibility of copy number variants (CNVs) detection is unclear. Results Here, we first benchmarked different CNV detection tools based on Illumina whole-genome sequencing (WGS) data of NA12878 and then assessed these tools in CNV detection based on DNBSEQ™ sequencing data from the same sample. When the same tool was used, the CNVs detected based on DNBSEQ™ and Illumina data were similar in quantity, length and distribution, while great differences existed within results from different tools and even based on data from a single platform. We further estimated the CNV detection power based on available CNV benchmarks of NA12878 and found similar precision and sensitivity between the DNBSEQ™ and Illumina platforms. We also found higher precision of CNVs shorter than 1 kbp based on DNBSEQ™ platforms than those based on Illumina platforms by using Pindel, DELLY and LUMPY. We carefully compared these two available benchmarks and found a large proportion of specific CNVs between them. Thus, we constructed a more complete CNV benchmark of NA12878 containing 3512 CNV regions. Conclusions We assessed and benchmarked CNV detections based on WGS with DNBSEQ™ platforms and provide guidelines for future studies.http://link.springer.com/article/10.1186/s12859-020-03859-xCopy number variant (CNV)Whole-genome sequencing (WGS)DNBSEQBenchmark
collection DOAJ
language English
format Article
sources DOAJ
author Junhua Rao
Lihua Peng
Xinming Liang
Hui Jiang
Chunyu Geng
Xia Zhao
Xin Liu
Guangyi Fan
Fang Chen
Feng Mu
spellingShingle Junhua Rao
Lihua Peng
Xinming Liang
Hui Jiang
Chunyu Geng
Xia Zhao
Xin Liu
Guangyi Fan
Fang Chen
Feng Mu
Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
BMC Bioinformatics
Copy number variant (CNV)
Whole-genome sequencing (WGS)
DNBSEQ
Benchmark
author_facet Junhua Rao
Lihua Peng
Xinming Liang
Hui Jiang
Chunyu Geng
Xia Zhao
Xin Liu
Guangyi Fan
Fang Chen
Feng Mu
author_sort Junhua Rao
title Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
title_short Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
title_full Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
title_fullStr Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
title_full_unstemmed Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
title_sort performance of copy number variants detection based on whole-genome sequencing by dnbseq platforms
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2020-11-01
description Abstract Background DNBSEQ™ platforms are new massively parallel sequencing (MPS) platforms that use DNA nanoball technology. Use of data generated from DNBSEQ™ platforms to detect single nucleotide variants (SNVs) and small insertions and deletions (indels) has proven to be quite effective, while the feasibility of copy number variants (CNVs) detection is unclear. Results Here, we first benchmarked different CNV detection tools based on Illumina whole-genome sequencing (WGS) data of NA12878 and then assessed these tools in CNV detection based on DNBSEQ™ sequencing data from the same sample. When the same tool was used, the CNVs detected based on DNBSEQ™ and Illumina data were similar in quantity, length and distribution, while great differences existed within results from different tools and even based on data from a single platform. We further estimated the CNV detection power based on available CNV benchmarks of NA12878 and found similar precision and sensitivity between the DNBSEQ™ and Illumina platforms. We also found higher precision of CNVs shorter than 1 kbp based on DNBSEQ™ platforms than those based on Illumina platforms by using Pindel, DELLY and LUMPY. We carefully compared these two available benchmarks and found a large proportion of specific CNVs between them. Thus, we constructed a more complete CNV benchmark of NA12878 containing 3512 CNV regions. Conclusions We assessed and benchmarked CNV detections based on WGS with DNBSEQ™ platforms and provide guidelines for future studies.
topic Copy number variant (CNV)
Whole-genome sequencing (WGS)
DNBSEQ
Benchmark
url http://link.springer.com/article/10.1186/s12859-020-03859-x
work_keys_str_mv AT junhuarao performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT lihuapeng performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT xinmingliang performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT huijiang performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT chunyugeng performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT xiazhao performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT xinliu performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT guangyifan performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT fangchen performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
AT fengmu performanceofcopynumbervariantsdetectionbasedonwholegenomesequencingbydnbseqplatforms
_version_ 1724424341413691392