Cluster stability scores for microarray data in cancer studies

Abstract Background A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define diseas...

Full description

Bibliographic Details
Main Authors:	Ghosh Debashis, Smolkin Mark
Format:	Article
Language:	English
Published:	BMC 2003-09-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/4/36

id	doaj-ca0d4ce246644696bf0368a883ca7203
record_format	Article
spelling	doaj-ca0d4ce246644696bf0368a883ca72032020-11-24T23:04:56ZengBMCBMC Bioinformatics1471-21052003-09-01413610.1186/1471-2105-4-36Cluster stability scores for microarray data in cancer studiesGhosh DebashisSmolkin Mark<p>Abstract</p> <p>Background</p> <p>A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define disease subtypes from microarray experiments in cancer settings. Assessing cluster reliability poses a major complication in analyzing output from clustering procedures. While most work has focused on estimating the number of clusters in a dataset, the question of stability of individual-level clusters has not been addressed.</p> <p>Results</p> <p>We address this problem by developing cluster stability scores using subsampling techniques. These scores exploit the redundancy in biologically discriminatory information on the chip. Our approach is generic and can be used with any clustering method. We propose procedures for calculating cluster stability scores for situations involving both known and unknown numbers of clusters. We also develop cluster-size adjusted stability scores. The method is illustrated by application to data three cancer studies; one involving childhood cancers, the second involving B-cell lymphoma, and the final is from a malignant melanoma study.</p> <p>Availability</p> <p>Code implementing the proposed analytic method can be obtained at the second author's website.</p> http://www.biomedcentral.com/1471-2105/4/36
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Ghosh Debashis Smolkin Mark
spellingShingle	Ghosh Debashis Smolkin Mark Cluster stability scores for microarray data in cancer studies BMC Bioinformatics
author_facet	Ghosh Debashis Smolkin Mark
author_sort	Ghosh Debashis
title	Cluster stability scores for microarray data in cancer studies
title_short	Cluster stability scores for microarray data in cancer studies
title_full	Cluster stability scores for microarray data in cancer studies
title_fullStr	Cluster stability scores for microarray data in cancer studies
title_full_unstemmed	Cluster stability scores for microarray data in cancer studies
title_sort	cluster stability scores for microarray data in cancer studies
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2003-09-01
description	<p>Abstract</p> <p>Background</p> <p>A potential benefit of profiling of tissue samples using microarrays is the generation of molecular fingerprints that will define subtypes of disease. Hierarchical clustering has been the primary analytical tool used to define disease subtypes from microarray experiments in cancer settings. Assessing cluster reliability poses a major complication in analyzing output from clustering procedures. While most work has focused on estimating the number of clusters in a dataset, the question of stability of individual-level clusters has not been addressed.</p> <p>Results</p> <p>We address this problem by developing cluster stability scores using subsampling techniques. These scores exploit the redundancy in biologically discriminatory information on the chip. Our approach is generic and can be used with any clustering method. We propose procedures for calculating cluster stability scores for situations involving both known and unknown numbers of clusters. We also develop cluster-size adjusted stability scores. The method is illustrated by application to data three cancer studies; one involving childhood cancers, the second involving B-cell lymphoma, and the final is from a malignant melanoma study.</p> <p>Availability</p> <p>Code implementing the proposed analytic method can be obtained at the second author's website.</p>
url	http://www.biomedcentral.com/1471-2105/4/36
work_keys_str_mv	AT ghoshdebashis clusterstabilityscoresformicroarraydataincancerstudies AT smolkinmark clusterstabilityscoresformicroarraydataincancerstudies
_version_	1725628458638245888

Cluster stability scores for microarray data in cancer studies

Similar Items