A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.

Non-negative matrix factorization (NMF) condenses high-dimensional data into lower-dimensional models subject to the requirement that data can only be added, never subtracted. However, the NMF problem does not have a unique solution, creating a need for additional constraints (regularization constra...

Full description

Bibliographic Details
Main Authors:	Leo Taslaman, Björn Nilsson
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2012-01-01
Series:	PLoS ONE
Online Access:	https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23133590/?tool=EBI

id	doaj-801bcbbcc3384ae3a416ec9c3e844900
record_format	Article
spelling	doaj-801bcbbcc3384ae3a416ec9c3e8449002021-03-04T00:07:13ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-01711e4633110.1371/journal.pone.0046331A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.Leo TaslamanBjörn NilssonNon-negative matrix factorization (NMF) condenses high-dimensional data into lower-dimensional models subject to the requirement that data can only be added, never subtracted. However, the NMF problem does not have a unique solution, creating a need for additional constraints (regularization constraints) to promote informative solutions. Regularized NMF problems are more complicated than conventional NMF problems, creating a need for computational methods that incorporate the extra constraints in a reliable way. We developed novel methods for regularized NMF based on block-coordinate descent with proximal point modification and a fast optimization procedure over the alpha simplex. Our framework has important advantages in that it (a) accommodates for a wide range of regularization terms, including sparsity-inducing terms like the L1 penalty, (b) guarantees that the solutions satisfy necessary conditions for optimality, ensuring that the results have well-defined numerical meaning, (c) allows the scale of the solution to be controlled exactly, and (d) is computationally efficient. We illustrate the use of our approach on in the context of gene expression microarray data analysis. The improvements described remedy key limitations of previous proposals, strengthen the theoretical basis of regularized NMF, and facilitate the use of regularized NMF in applications.https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23133590/?tool=EBI
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Leo Taslaman Björn Nilsson
spellingShingle	Leo Taslaman Björn Nilsson A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data. PLoS ONE
author_facet	Leo Taslaman Björn Nilsson
author_sort	Leo Taslaman
title	A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.
title_short	A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.
title_full	A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.
title_fullStr	A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.
title_full_unstemmed	A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.
title_sort	framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2012-01-01
description	Non-negative matrix factorization (NMF) condenses high-dimensional data into lower-dimensional models subject to the requirement that data can only be added, never subtracted. However, the NMF problem does not have a unique solution, creating a need for additional constraints (regularization constraints) to promote informative solutions. Regularized NMF problems are more complicated than conventional NMF problems, creating a need for computational methods that incorporate the extra constraints in a reliable way. We developed novel methods for regularized NMF based on block-coordinate descent with proximal point modification and a fast optimization procedure over the alpha simplex. Our framework has important advantages in that it (a) accommodates for a wide range of regularization terms, including sparsity-inducing terms like the L1 penalty, (b) guarantees that the solutions satisfy necessary conditions for optimality, ensuring that the results have well-defined numerical meaning, (c) allows the scale of the solution to be controlled exactly, and (d) is computationally efficient. We illustrate the use of our approach on in the context of gene expression microarray data analysis. The improvements described remedy key limitations of previous proposals, strengthen the theoretical basis of regularized NMF, and facilitate the use of regularized NMF in applications.
url	https://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23133590/?tool=EBI
work_keys_str_mv	AT leotaslaman aframeworkforregularizednonnegativematrixfactorizationwithapplicationtotheanalysisofgeneexpressiondata AT bjornnilsson aframeworkforregularizednonnegativematrixfactorizationwithapplicationtotheanalysisofgeneexpressiondata AT leotaslaman frameworkforregularizednonnegativematrixfactorizationwithapplicationtotheanalysisofgeneexpressiondata AT bjornnilsson frameworkforregularizednonnegativematrixfactorizationwithapplicationtotheanalysisofgeneexpressiondata
_version_	1714810699694211072

A framework for regularized non-negative matrix factorization, with application to the analysis of gene expression data.

Similar Items