CoNVEX: copy number variation estimation in exome sequencing data using HMM
<p>Abstract</p> <p>Background</p> <p>One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, find...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2013-01-01
|
Series: | BMC Bioinformatics |
Subjects: |
id |
doaj-f773624b614648fe81cddd3e15be458a |
---|---|
record_format |
Article |
spelling |
doaj-f773624b614648fe81cddd3e15be458a2020-11-25T01:40:12ZengBMCBMC Bioinformatics1471-21052013-01-0114Suppl 2S210.1186/1471-2105-14-S2-S2CoNVEX: copy number variation estimation in exome sequencing data using HMMAmarasinghe Kaushalya CLi JasonHalgamuge Saman K<p>Abstract</p> <p>Background</p> <p>One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been fully explored.</p> <p>Results</p> <p>We present a new method, called CoNVEX, to estimate copy number variation in whole exome sequencing data. It uses ratio of tumour and matched normal average read depths at each exonic region, to predict the copy gain or loss. The useful signal produced by WES data will be hindered by the intrinsic noise present in the data itself. This limits its capacity to be used as a highly reliable CNV detection source. Here, we propose a method that consists of discrete wavelet transform (DWT) to reduce noise. The identification of copy number gains/losses of each targeted region is performed by a Hidden Markov Model (HMM).</p> <p>Conclusion</p> <p>HMM is frequently used to identify CNV in data produced by various technologies including Array Comparative Genomic Hybridization (aCGH) and WGS. Here, we propose an HMM to detect CNV in cancer exome data. We used modified data from 1000 Genomes project to evaluate the performance of the proposed method. Using these data we have shown that CoNVEX outperforms the existing methods significantly in terms of precision. Overall, CoNVEX achieved a sensitivity of more than 92% and a precision of more than 50%.</p> CNV detectionCancer GenomeTargeted resequencingWhole exome sequencingHidden Markov ModelsDiscrete Wavelet Transform |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Amarasinghe Kaushalya C Li Jason Halgamuge Saman K |
spellingShingle |
Amarasinghe Kaushalya C Li Jason Halgamuge Saman K CoNVEX: copy number variation estimation in exome sequencing data using HMM BMC Bioinformatics CNV detection Cancer Genome Targeted resequencing Whole exome sequencing Hidden Markov Models Discrete Wavelet Transform |
author_facet |
Amarasinghe Kaushalya C Li Jason Halgamuge Saman K |
author_sort |
Amarasinghe Kaushalya C |
title |
CoNVEX: copy number variation estimation in exome sequencing data using HMM |
title_short |
CoNVEX: copy number variation estimation in exome sequencing data using HMM |
title_full |
CoNVEX: copy number variation estimation in exome sequencing data using HMM |
title_fullStr |
CoNVEX: copy number variation estimation in exome sequencing data using HMM |
title_full_unstemmed |
CoNVEX: copy number variation estimation in exome sequencing data using HMM |
title_sort |
convex: copy number variation estimation in exome sequencing data using hmm |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2013-01-01 |
description |
<p>Abstract</p> <p>Background</p> <p>One of the main types of genetic variations in cancer is Copy Number Variations (CNV). Whole exome sequenicng (WES) is a popular alternative to whole genome sequencing (WGS) to study disease specific genomic variations. However, finding CNV in Cancer samples using WES data has not been fully explored.</p> <p>Results</p> <p>We present a new method, called CoNVEX, to estimate copy number variation in whole exome sequencing data. It uses ratio of tumour and matched normal average read depths at each exonic region, to predict the copy gain or loss. The useful signal produced by WES data will be hindered by the intrinsic noise present in the data itself. This limits its capacity to be used as a highly reliable CNV detection source. Here, we propose a method that consists of discrete wavelet transform (DWT) to reduce noise. The identification of copy number gains/losses of each targeted region is performed by a Hidden Markov Model (HMM).</p> <p>Conclusion</p> <p>HMM is frequently used to identify CNV in data produced by various technologies including Array Comparative Genomic Hybridization (aCGH) and WGS. Here, we propose an HMM to detect CNV in cancer exome data. We used modified data from 1000 Genomes project to evaluate the performance of the proposed method. Using these data we have shown that CoNVEX outperforms the existing methods significantly in terms of precision. Overall, CoNVEX achieved a sensitivity of more than 92% and a precision of more than 50%.</p> |
topic |
CNV detection Cancer Genome Targeted resequencing Whole exome sequencing Hidden Markov Models Discrete Wavelet Transform |
work_keys_str_mv |
AT amarasinghekaushalyac convexcopynumbervariationestimationinexomesequencingdatausinghmm AT lijason convexcopynumbervariationestimationinexomesequencingdatausinghmm AT halgamugesamank convexcopynumbervariationestimationinexomesequencingdatausinghmm |
_version_ |
1725046429123084288 |