GROM-RD: resolving genomic biases to improve read depth detection of copy number variants

Amplifications or deletions of genome segments, known as copy number variants (CNVs), have been associated with many diseases. Read depth analysis of next-generation sequencing (NGS) is an essential method of detecting CNVs. However, genome read coverage is frequently distorted by various biases of...

Full description

Bibliographic Details
Main Authors: Sean D. Smith, Joseph K. Kawash, Andrey Grigoriev
Format: Article
Language:English
Published: PeerJ Inc. 2015-03-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/836.pdf
id doaj-541d4b3b14af445c8289111cf802ddfe
record_format Article
spelling doaj-541d4b3b14af445c8289111cf802ddfe2020-11-24T22:13:37ZengPeerJ Inc.PeerJ2167-83592015-03-013e83610.7717/peerj.836836GROM-RD: resolving genomic biases to improve read depth detection of copy number variantsSean D. Smith0Joseph K. Kawash1Andrey Grigoriev2Department of Biology, Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, USADepartment of Biology, Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, USADepartment of Biology, Center for Computational and Integrative Biology, Rutgers University, Camden, NJ, USAAmplifications or deletions of genome segments, known as copy number variants (CNVs), have been associated with many diseases. Read depth analysis of next-generation sequencing (NGS) is an essential method of detecting CNVs. However, genome read coverage is frequently distorted by various biases of NGS platforms, which reduce predictive capabilities of existing approaches. Additionally, the use of read depth tools has been somewhat hindered by imprecise breakpoint identification. We developed GROM-RD, an algorithm that analyzes multiple biases in read coverage to detect CNVs in NGS data. We found non-uniform variance across distinct GC regions after using existing GC bias correction methods and developed a novel approach to normalize such variance. Although complex and repetitive genome segments complicate CNV detection, GROM-RD adjusts for repeat bias and uses a two-pipeline masking approach to detect CNVs in complex and repetitive segments while improving sensitivity in less complicated regions. To overcome a typical weakness of RD methods, GROM-RD employs a CNV search using size-varying overlapping windows to improve breakpoint resolution. We compared our method to two widely used programs based on read depth methods, CNVnator and RDXplorer, and observed improved CNV detection and breakpoint accuracy for GROM-RD. GROM-RD is available at http://grigoriev.rutgers.edu/software/.https://peerj.com/articles/836.pdfCopy number variantGenomic biasNext gen sequencing
collection DOAJ
language English
format Article
sources DOAJ
author Sean D. Smith
Joseph K. Kawash
Andrey Grigoriev
spellingShingle Sean D. Smith
Joseph K. Kawash
Andrey Grigoriev
GROM-RD: resolving genomic biases to improve read depth detection of copy number variants
PeerJ
Copy number variant
Genomic bias
Next gen sequencing
author_facet Sean D. Smith
Joseph K. Kawash
Andrey Grigoriev
author_sort Sean D. Smith
title GROM-RD: resolving genomic biases to improve read depth detection of copy number variants
title_short GROM-RD: resolving genomic biases to improve read depth detection of copy number variants
title_full GROM-RD: resolving genomic biases to improve read depth detection of copy number variants
title_fullStr GROM-RD: resolving genomic biases to improve read depth detection of copy number variants
title_full_unstemmed GROM-RD: resolving genomic biases to improve read depth detection of copy number variants
title_sort grom-rd: resolving genomic biases to improve read depth detection of copy number variants
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2015-03-01
description Amplifications or deletions of genome segments, known as copy number variants (CNVs), have been associated with many diseases. Read depth analysis of next-generation sequencing (NGS) is an essential method of detecting CNVs. However, genome read coverage is frequently distorted by various biases of NGS platforms, which reduce predictive capabilities of existing approaches. Additionally, the use of read depth tools has been somewhat hindered by imprecise breakpoint identification. We developed GROM-RD, an algorithm that analyzes multiple biases in read coverage to detect CNVs in NGS data. We found non-uniform variance across distinct GC regions after using existing GC bias correction methods and developed a novel approach to normalize such variance. Although complex and repetitive genome segments complicate CNV detection, GROM-RD adjusts for repeat bias and uses a two-pipeline masking approach to detect CNVs in complex and repetitive segments while improving sensitivity in less complicated regions. To overcome a typical weakness of RD methods, GROM-RD employs a CNV search using size-varying overlapping windows to improve breakpoint resolution. We compared our method to two widely used programs based on read depth methods, CNVnator and RDXplorer, and observed improved CNV detection and breakpoint accuracy for GROM-RD. GROM-RD is available at http://grigoriev.rutgers.edu/software/.
topic Copy number variant
Genomic bias
Next gen sequencing
url https://peerj.com/articles/836.pdf
work_keys_str_mv AT seandsmith gromrdresolvinggenomicbiasestoimprovereaddepthdetectionofcopynumbervariants
AT josephkkawash gromrdresolvinggenomicbiasestoimprovereaddepthdetectionofcopynumbervariants
AT andreygrigoriev gromrdresolvinggenomicbiasestoimprovereaddepthdetectionofcopynumbervariants
_version_ 1725800468647510016