Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure
Microarrays have revolutionized biotechnological research. The analysis of new data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are applied to create groups of genes that exhibit a similar behavior. Biclustering emerges as a valuable...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2014-01-01
|
Series: | The Scientific World Journal |
Online Access: | http://dx.doi.org/10.1155/2014/624371 |
id |
doaj-7b5432c6d78d4b4297a64adfa79e113b |
---|---|
record_format |
Article |
spelling |
doaj-7b5432c6d78d4b4297a64adfa79e113b2020-11-25T01:40:37ZengHindawi LimitedThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/624371624371Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation MeasureDavid Gutiérrez-Avilés0Cristina Rubio-Escudero1Department of Computer Science, University of Seville, Avenida Reina Mercedes s/n, 41012 Seville, SpainDepartment of Computer Science, University of Seville, Avenida Reina Mercedes s/n, 41012 Seville, SpainMicroarrays have revolutionized biotechnological research. The analysis of new data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are applied to create groups of genes that exhibit a similar behavior. Biclustering emerges as a valuable tool for microarray data analysis since it relaxes the constraints for grouping, allowing genes to be evaluated only under a subset of the conditions. However, if a third dimension appears in the data, triclustering is the appropriate tool for the analysis. This occurs in longitudinal experiments in which the genes are evaluated under conditions at several time points. All clustering, biclustering, and triclustering techniques guide their search for solutions by a measure that evaluates the quality of clusters. We present an evaluation measure for triclusters called Mean Square Residue 3D. This measure is based on the classic biclustering measure Mean Square Residue. Mean Square Residue 3D has been applied to both synthetic and real data and it has proved to be capable of extracting groups of genes with homogeneous patterns in subsets of conditions and times, and these groups have shown a high correlation level and they are also related to their functional annotations extracted from the Gene Ontology project.http://dx.doi.org/10.1155/2014/624371 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
David Gutiérrez-Avilés Cristina Rubio-Escudero |
spellingShingle |
David Gutiérrez-Avilés Cristina Rubio-Escudero Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure The Scientific World Journal |
author_facet |
David Gutiérrez-Avilés Cristina Rubio-Escudero |
author_sort |
David Gutiérrez-Avilés |
title |
Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure |
title_short |
Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure |
title_full |
Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure |
title_fullStr |
Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure |
title_full_unstemmed |
Mining 3D Patterns from Gene Expression Temporal Data: A New Tricluster Evaluation Measure |
title_sort |
mining 3d patterns from gene expression temporal data: a new tricluster evaluation measure |
publisher |
Hindawi Limited |
series |
The Scientific World Journal |
issn |
2356-6140 1537-744X |
publishDate |
2014-01-01 |
description |
Microarrays have revolutionized biotechnological research. The analysis of new data generated represents a computational challenge due to the characteristics of these data. Clustering techniques are applied to create groups of genes that exhibit a similar behavior. Biclustering emerges as a valuable tool for microarray data analysis since it relaxes the constraints for grouping, allowing genes to be evaluated only under a subset of the conditions. However, if a third dimension appears in the data, triclustering is the appropriate tool for the analysis. This occurs in longitudinal experiments in which the genes are evaluated under conditions at several time points. All clustering, biclustering, and triclustering techniques guide their search for solutions by a measure that evaluates the quality of clusters. We present an evaluation measure for triclusters called Mean Square Residue 3D. This measure is based on the classic biclustering measure Mean Square Residue. Mean Square Residue 3D has been applied to both synthetic and real data and it has proved to be capable of extracting groups of genes with homogeneous patterns in subsets of conditions and times, and these groups have shown a high correlation level and they are also related to their functional annotations extracted from the Gene Ontology project. |
url |
http://dx.doi.org/10.1155/2014/624371 |
work_keys_str_mv |
AT davidgutierrezaviles mining3dpatternsfromgeneexpressiontemporaldataanewtriclusterevaluationmeasure AT cristinarubioescudero mining3dpatternsfromgeneexpressiontemporaldataanewtriclusterevaluationmeasure |
_version_ |
1725044641508622336 |