Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure

Microarrays have revolutionized biotechnological research. The analysis of new data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are applied to create groups of genes that exhibit a similar behavior. Biclustering emerges as a valuable...

Full description

Bibliographic Details
Main Authors: David Gutiérrez-Avilés, Cristina Rubio-Escudero
Format: Article
Language:English
Published: Hindawi Limited 2014-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2014/624371
id doaj-7b5432c6d78d4b4297a64adfa79e113b
record_format Article
spelling doaj-7b5432c6d78d4b4297a64adfa79e113b2020-11-25T01:40:37ZengHindawi LimitedThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/624371624371Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation MeasureDavid Gutiérrez-Avilés0Cristina Rubio-Escudero1Department of Computer Science, University of Seville, Avenida Reina Mercedes s/n, 41012 Seville, SpainDepartment of Computer Science, University of Seville, Avenida Reina Mercedes s/n, 41012 Seville, SpainMicroarrays have revolutionized biotechnological research. The analysis of new data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are applied to create groups of genes that exhibit a similar behavior. Biclustering emerges as a valuable tool for microarray data analysis since it relaxes the constraints for grouping, allowing genes to be evaluated only under a subset of the conditions. However, if a third dimension appears in the data, triclustering is the appropriate tool for the analysis. This occurs in longitudinal experiments in which the genes are evaluated under conditions at several time points. All clustering, biclustering, and triclustering techniques guide their search for solutions by a measure that evaluates the quality of clusters. We present an evaluation measure for triclusters called Mean Square Residue 3D. This measure is based on the classic biclustering measure Mean Square Residue. Mean Square Residue 3D has been applied to both synthetic and real data and it has proved to be capable of extracting groups of genes with homogeneous patterns in subsets of conditions and times, and these groups have shown a high correlation level and they are also related to their functional annotations extracted from the Gene Ontology project.http://dx.doi.org/10.1155/2014/624371
collection DOAJ
language English
format Article
sources DOAJ
author David Gutiérrez-Avilés
Cristina Rubio-Escudero
spellingShingle David Gutiérrez-Avilés
Cristina Rubio-Escudero
Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
The Scientific World Journal
author_facet David Gutiérrez-Avilés
Cristina Rubio-Escudero
author_sort David Gutiérrez-Avilés
title Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
title_short Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
title_full Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
title_fullStr Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
title_full_unstemmed Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
title_sort mining 3d patterns from gene expression temporal data: a new tricluster evaluation measure
publisher Hindawi Limited
series The Scientific World Journal
issn 2356-6140
1537-744X
publishDate 2014-01-01
description Microarrays have revolutionized biotechnological research. The analysis of new data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are applied to create groups of genes that exhibit a similar behavior. Biclustering emerges as a valuable tool for microarray data analysis since it relaxes the constraints for grouping, allowing genes to be evaluated only under a subset of the conditions. However, if a third dimension appears in the data, triclustering is the appropriate tool for the analysis. This occurs in longitudinal experiments in which the genes are evaluated under conditions at several time points. All clustering, biclustering, and triclustering techniques guide their search for solutions by a measure that evaluates the quality of clusters. We present an evaluation measure for triclusters called Mean Square Residue 3D. This measure is based on the classic biclustering measure Mean Square Residue. Mean Square Residue 3D has been applied to both synthetic and real data and it has proved to be capable of extracting groups of genes with homogeneous patterns in subsets of conditions and times, and these groups have shown a high correlation level and they are also related to their functional annotations extracted from the Gene Ontology project.
url http://dx.doi.org/10.1155/2014/624371
work_keys_str_mv AT davidgutierrezaviles mining3dpatternsfromgeneexpressiontemporaldataanewtriclusterevaluationmeasure
AT cristinarubioescudero mining3dpatternsfromgeneexpressiontemporaldataanewtriclusterevaluationmeasure
_version_ 1725044641508622336