CoNVEX: copy number variation estimation in exome sequencing data using HMM

<p>Abstract</p> <p>Background</p> <p>One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, find...

Full description

Bibliographic Details
Main Authors: Amarasinghe Kaushalya C, Li Jason, Halgamuge Saman K
Format: Article
Language:English
Published: BMC 2013-01-01
Series:BMC Bioinformatics
Subjects:
id doaj-f773624b614648fe81cddd3e15be458a
record_format Article
spelling doaj-f773624b614648fe81cddd3e15be458a2020-11-25T01:40:12ZengBMCBMC Bioinformatics1471-21052013-01-0114Suppl 2S210.1186/1471-2105-14-S2-S2CoNVEX: copy number variation estimation in exome sequencing data using HMMAmarasinghe Kaushalya CLi JasonHalgamuge Saman K<p>Abstract</p> <p>Background</p> <p>One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been fully explored.</p> <p>Results</p> <p>We present a new method, called CoNVEX, to estimate copy number variation in whole exome sequencing data. It uses ratio of tumour and matched normal average read depths at each exonic region, to predict the copy gain or loss. The useful signal produced by WES data will be hindered by the intrinsic noise present in the data itself. This limits its capacity to be used as a highly reliable CNV detection source. Here, we propose a method that consists of discrete wavelet transform (DWT) to reduce noise. The identification of copy number gains/losses of each targeted region is performed by a Hidden Markov Model (HMM).</p> <p>Conclusion</p> <p>HMM is frequently used to identify CNV in data produced by various technologies including Array Comparative Genomic Hybridization (aCGH) and WGS. Here, we propose an HMM to detect CNV in cancer exome data. We used modified data from 1000 Genomes project to evaluate the performance of the proposed method. Using these data we have shown that CoNVEX outperforms the existing methods significantly in terms of precision. Overall, CoNVEX achieved a sensitivity of more than 92% and a precision of more than 50%.</p> CNV detectionCancer GenomeTargeted resequencingWhole exome sequencingHidden Markov ModelsDiscrete Wavelet Transform
collection DOAJ
language English
format Article
sources DOAJ
author Amarasinghe Kaushalya C
Li Jason
Halgamuge Saman K
spellingShingle Amarasinghe Kaushalya C
Li Jason
Halgamuge Saman K
CoNVEX: copy number variation estimation in exome sequencing data using HMM
BMC Bioinformatics
CNV detection
Cancer Genome
Targeted resequencing
Whole exome sequencing
Hidden Markov Models
Discrete Wavelet Transform
author_facet Amarasinghe Kaushalya C
Li Jason
Halgamuge Saman K
author_sort Amarasinghe Kaushalya C
title CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_short CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_full CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_fullStr CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_full_unstemmed CoNVEX: copy number variation estimation in exome sequencing data using HMM
title_sort convex: copy number variation estimation in exome sequencing data using hmm
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2013-01-01
description <p>Abstract</p> <p>Background</p> <p>One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been fully explored.</p> <p>Results</p> <p>We present a new method, called CoNVEX, to estimate copy number variation in whole exome sequencing data. It uses ratio of tumour and matched normal average read depths at each exonic region, to predict the copy gain or loss. The useful signal produced by WES data will be hindered by the intrinsic noise present in the data itself. This limits its capacity to be used as a highly reliable CNV detection source. Here, we propose a method that consists of discrete wavelet transform (DWT) to reduce noise. The identification of copy number gains/losses of each targeted region is performed by a Hidden Markov Model (HMM).</p> <p>Conclusion</p> <p>HMM is frequently used to identify CNV in data produced by various technologies including Array Comparative Genomic Hybridization (aCGH) and WGS. Here, we propose an HMM to detect CNV in cancer exome data. We used modified data from 1000 Genomes project to evaluate the performance of the proposed method. Using these data we have shown that CoNVEX outperforms the existing methods significantly in terms of precision. Overall, CoNVEX achieved a sensitivity of more than 92% and a precision of more than 50%.</p>
topic CNV detection
Cancer Genome
Targeted resequencing
Whole exome sequencing
Hidden Markov Models
Discrete Wavelet Transform
work_keys_str_mv AT amarasinghekaushalyac convexcopynumbervariationestimationinexomesequencingdatausinghmm
AT lijason convexcopynumbervariationestimationinexomesequencingdatausinghmm
AT halgamugesamank convexcopynumbervariationestimationinexomesequencingdatausinghmm
_version_ 1725046429123084288