A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data

<p>Abstract</p> <p>Background</p> <p>The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the majo...

Full description

Bibliographic Details
Main Authors: Zhang Ke, Yang Yi, Devanarayan Viswanath, Xie Linglin, Deng Youping, Donald Sens
Format: Article
Language:English
Published: BMC 2011-12-01
Series:BMC Genomics
Online Access:http://www.biomedcentral.com/1471-2164/12/S5/S10
id doaj-5bc64de18ac74166ac2b71109ce54be5
record_format Article
spelling doaj-5bc64de18ac74166ac2b71109ce54be52020-11-25T00:25:44ZengBMCBMC Genomics1471-21642011-12-0112Suppl 5S1010.1186/1471-2164-12-S5-S10A hidden Markov model-based algorithm for identifying tumour subtype using array CGH dataZhang KeYang YiDevanarayan ViswanathXie LinglinDeng YoupingDonald Sens<p>Abstract</p> <p>Background</p> <p>The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the major challenges for developing aCGH sample clustering are the high spatial correlation between aCGH markers and the low computing efficiency. A mixture hidden Markov model based algorithm was developed to address these two challenges.</p> <p>Results</p> <p>The hidden Markov model (HMM) was used to model the spatial correlation between aCGH markers. A fast clustering algorithm was implemented and real data analysis on glioma aCGH data has shown that it converges to the optimal cluster rapidly and the computation time is proportional to the sample size. Simulation results showed that this HMM based clustering (HMMC) method has a substantially lower error rate than NMF clustering. The HMMC results for glioma data were significantly associated with clinical outcomes.</p> <p>Conclusions</p> <p>We have developed a fast clustering algorithm to identify tumor subtypes based on DNA copy number aberrations. The performance of the proposed HMMC method has been evaluated using both simulated and real aCGH data. The software for HMMC in both R and C++ is available in ND INBRE website <url>http://ndinbre.org/programs/bioinformatics.php.</url></p> http://www.biomedcentral.com/1471-2164/12/S5/S10
collection DOAJ
language English
format Article
sources DOAJ
author Zhang Ke
Yang Yi
Devanarayan Viswanath
Xie Linglin
Deng Youping
Donald Sens
spellingShingle Zhang Ke
Yang Yi
Devanarayan Viswanath
Xie Linglin
Deng Youping
Donald Sens
A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
BMC Genomics
author_facet Zhang Ke
Yang Yi
Devanarayan Viswanath
Xie Linglin
Deng Youping
Donald Sens
author_sort Zhang Ke
title A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_short A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_full A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_fullStr A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_full_unstemmed A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_sort hidden markov model-based algorithm for identifying tumour subtype using array cgh data
publisher BMC
series BMC Genomics
issn 1471-2164
publishDate 2011-12-01
description <p>Abstract</p> <p>Background</p> <p>The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the major challenges for developing aCGH sample clustering are the high spatial correlation between aCGH markers and the low computing efficiency. A mixture hidden Markov model based algorithm was developed to address these two challenges.</p> <p>Results</p> <p>The hidden Markov model (HMM) was used to model the spatial correlation between aCGH markers. A fast clustering algorithm was implemented and real data analysis on glioma aCGH data has shown that it converges to the optimal cluster rapidly and the computation time is proportional to the sample size. Simulation results showed that this HMM based clustering (HMMC) method has a substantially lower error rate than NMF clustering. The HMMC results for glioma data were significantly associated with clinical outcomes.</p> <p>Conclusions</p> <p>We have developed a fast clustering algorithm to identify tumor subtypes based on DNA copy number aberrations. The performance of the proposed HMMC method has been evaluated using both simulated and real aCGH data. The software for HMMC in both R and C++ is available in ND INBRE website <url>http://ndinbre.org/programs/bioinformatics.php.</url></p>
url http://www.biomedcentral.com/1471-2164/12/S5/S10
work_keys_str_mv AT zhangke ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT yangyi ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT devanarayanviswanath ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT xielinglin ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT dengyouping ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT donaldsens ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT zhangke hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT yangyi hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT devanarayanviswanath hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT xielinglin hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT dengyouping hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT donaldsens hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
_version_ 1725347218738642944