Computational Complexity Of Bi-clustering

In this work we formalize a new natural objective (or cost) function for bi-clustering - Monochromatic bi-clustering. Our objective function is suitable for detecting meaningful homogenous clusters based on categorical valued input matrices. Such problems have arisen recently in systems biology wher...

Full description

Bibliographic Details
Main Author: Wulff, Sharon Jay
Language:en
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10012/3900
id ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-3900
record_format oai_dc
spelling ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-39002013-01-08T18:51:25ZWulff, Sharon Jay2008-08-26T20:42:25Z2008-08-26T20:42:25Z2008-08-26T20:42:25Z2008http://hdl.handle.net/10012/3900In this work we formalize a new natural objective (or cost) function for bi-clustering - Monochromatic bi-clustering. Our objective function is suitable for detecting meaningful homogenous clusters based on categorical valued input matrices. Such problems have arisen recently in systems biology where researchers have inferred functional classifications of biological agents based on their pairwise interactions. We analyze the computational complexity of the resulting optimization problems. We show that finding optimal solutions is NP-hard and complement this result by introducing a polynomial time approximation algorithm for this bi-clustering task. This is the first positive approximation guarantee for bi-clustering algorithms. We also show that bi-clustering with our objective function can be viewed as a generalization of correlation clustering.enBi-ClusteringNP-hardnessPolynomial time approximation scheme (PTAS)correlation clusteringComputational Complexity Of Bi-clusteringThesis or DissertationSchool of Computer ScienceMaster of ScienceComputer Science
collection NDLTD
language en
sources NDLTD
topic Bi-Clustering
NP-hardness
Polynomial time approximation scheme (PTAS)
correlation clustering
Computer Science
spellingShingle Bi-Clustering
NP-hardness
Polynomial time approximation scheme (PTAS)
correlation clustering
Computer Science
Wulff, Sharon Jay
Computational Complexity Of Bi-clustering
description In this work we formalize a new natural objective (or cost) function for bi-clustering - Monochromatic bi-clustering. Our objective function is suitable for detecting meaningful homogenous clusters based on categorical valued input matrices. Such problems have arisen recently in systems biology where researchers have inferred functional classifications of biological agents based on their pairwise interactions. We analyze the computational complexity of the resulting optimization problems. We show that finding optimal solutions is NP-hard and complement this result by introducing a polynomial time approximation algorithm for this bi-clustering task. This is the first positive approximation guarantee for bi-clustering algorithms. We also show that bi-clustering with our objective function can be viewed as a generalization of correlation clustering.
author Wulff, Sharon Jay
author_facet Wulff, Sharon Jay
author_sort Wulff, Sharon Jay
title Computational Complexity Of Bi-clustering
title_short Computational Complexity Of Bi-clustering
title_full Computational Complexity Of Bi-clustering
title_fullStr Computational Complexity Of Bi-clustering
title_full_unstemmed Computational Complexity Of Bi-clustering
title_sort computational complexity of bi-clustering
publishDate 2008
url http://hdl.handle.net/10012/3900
work_keys_str_mv AT wulffsharonjay computationalcomplexityofbiclustering
_version_ 1716573174667149312