Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data

<p>Abstract</p> <p>Background</p> <p>Clustering-based methods on gene-expression analysis have been shown to be useful in biomedical applications such as cancer subtype discovery. Among them, Matrix factorization (MF) is advantageous for clustering gene expression patte...

Full description

Bibliographic Details
Main Authors: Kim Mi, Seo Hwa, Joung Je-Gun, Kim Ju
Format: Article
Language:English
Published: BMC 2011-11-01
Series:BMC Bioinformatics
id doaj-d60218227d454ae395e3cfd98617869c
record_format Article
spelling doaj-d60218227d454ae395e3cfd98617869c2020-11-25T01:40:02ZengBMCBMC Bioinformatics1471-21052011-11-0112Suppl 13S810.1186/1471-2105-12-S13-S8Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression dataKim MiSeo HwaJoung Je-GunKim Ju<p>Abstract</p> <p>Background</p> <p>Clustering-based methods on gene-expression analysis have been shown to be useful in biomedical applications such as cancer subtype discovery. Among them, Matrix factorization (MF) is advantageous for clustering gene expression patterns from DNA microarray experiments, as it efficiently reduces the dimension of gene expression data. Although several MF methods have been proposed for clustering gene expression patterns, a systematic evaluation has not been reported yet.</p> <p>Results</p> <p>Here we evaluated the clustering performance of orthogonal and non-orthogonal MFs by a total of nine measurements for performance in four gene expression datasets and one well-known dataset for clustering. Specifically, we employed a non-orthogonal MF algorithm, BSNMF (Bi-directional Sparse Non-negative Matrix Factorization), that applies bi-directional sparseness constraints superimposed on non-negative constraints, comprising a few dominantly co-expressed genes and samples together. Non-orthogonal MFs tended to show better clustering-quality and prediction-accuracy indices than orthogonal MFs as well as a traditional method, K-means. Moreover, BSNMF showed improved performance in these measurements. Non-orthogonal MFs including BSNMF showed also good performance in the functional enrichment test using Gene Ontology terms and biological pathways.</p> <p>Conclusions</p> <p>In conclusion, the clustering performance of orthogonal and non-orthogonal MFs was appropriately evaluated for clustering microarray data by comprehensive measurements. This study showed that non-orthogonal MFs have better performance than orthogonal MFs and <it>K</it>-means for clustering microarray data.</p>
collection DOAJ
language English
format Article
sources DOAJ
author Kim Mi
Seo Hwa
Joung Je-Gun
Kim Ju
spellingShingle Kim Mi
Seo Hwa
Joung Je-Gun
Kim Ju
Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
BMC Bioinformatics
author_facet Kim Mi
Seo Hwa
Joung Je-Gun
Kim Ju
author_sort Kim Mi
title Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
title_short Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
title_full Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
title_fullStr Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
title_full_unstemmed Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data
title_sort comprehensive evaluation of matrix factorization methods for the analysis of dna microarray gene expression data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-11-01
description <p>Abstract</p> <p>Background</p> <p>Clustering-based methods on gene-expression analysis have been shown to be useful in biomedical applications such as cancer subtype discovery. Among them, Matrix factorization (MF) is advantageous for clustering gene expression patterns from DNA microarray experiments, as it efficiently reduces the dimension of gene expression data. Although several MF methods have been proposed for clustering gene expression patterns, a systematic evaluation has not been reported yet.</p> <p>Results</p> <p>Here we evaluated the clustering performance of orthogonal and non-orthogonal MFs by a total of nine measurements for performance in four gene expression datasets and one well-known dataset for clustering. Specifically, we employed a non-orthogonal MF algorithm, BSNMF (Bi-directional Sparse Non-negative Matrix Factorization), that applies bi-directional sparseness constraints superimposed on non-negative constraints, comprising a few dominantly co-expressed genes and samples together. Non-orthogonal MFs tended to show better clustering-quality and prediction-accuracy indices than orthogonal MFs as well as a traditional method, K-means. Moreover, BSNMF showed improved performance in these measurements. Non-orthogonal MFs including BSNMF showed also good performance in the functional enrichment test using Gene Ontology terms and biological pathways.</p> <p>Conclusions</p> <p>In conclusion, the clustering performance of orthogonal and non-orthogonal MFs was appropriately evaluated for clustering microarray data by comprehensive measurements. This study showed that non-orthogonal MFs have better performance than orthogonal MFs and <it>K</it>-means for clustering microarray data.</p>
work_keys_str_mv AT kimmi comprehensiveevaluationofmatrixfactorizationmethodsfortheanalysisofdnamicroarraygeneexpressiondata
AT seohwa comprehensiveevaluationofmatrixfactorizationmethodsfortheanalysisofdnamicroarraygeneexpressiondata
AT joungjegun comprehensiveevaluationofmatrixfactorizationmethodsfortheanalysisofdnamicroarraygeneexpressiondata
AT kimju comprehensiveevaluationofmatrixfactorizationmethodsfortheanalysisofdnamicroarraygeneexpressiondata
_version_ 1725047445993291776