A temporal precedence based clustering method for gene expression microarray data

<p>Abstract</p> <p>Background</p> <p>Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together accordi...

Full description

Bibliographic Details
Main Authors: Buchanan-Wollaston Vicky, Li Chang-Tsun, Krishna Ritesh
Format: Article
Language:English
Published: BMC 2010-01-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/11/68
id doaj-74a8d79098eb4c63875dc4fc883fbd11
record_format Article
spelling doaj-74a8d79098eb4c63875dc4fc883fbd112020-11-25T01:18:23ZengBMCBMC Bioinformatics1471-21052010-01-011116810.1186/1471-2105-11-68A temporal precedence based clustering method for gene expression microarray dataBuchanan-Wollaston VickyLi Chang-TsunKrishna Ritesh<p>Abstract</p> <p>Background</p> <p>Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not.</p> <p>Results</p> <p>A gene-association matrix is constructed by testing temporal relationships between pairs of genes using the Granger causality test. The association matrix is further analyzed using a graph-theoretic technique to detect highly connected components representing interesting biological modules. We test our approach on synthesized datasets and real biological datasets obtained for Arabidopsis thaliana. We show the effectiveness of our approach by analyzing the results using the existing biological literature. We also report interesting structural properties of the association network commonly desired in any biological system.</p> <p>Conclusions</p> <p>Our experiments on synthesized and real microarray datasets show that our approach produces encouraging results. The method is simple in implementation and is statistically traceable at each step. The method can produce sets of functionally related genes which can be further used for reverse-engineering of gene circuits.</p> http://www.biomedcentral.com/1471-2105/11/68
collection DOAJ
language English
format Article
sources DOAJ
author Buchanan-Wollaston Vicky
Li Chang-Tsun
Krishna Ritesh
spellingShingle Buchanan-Wollaston Vicky
Li Chang-Tsun
Krishna Ritesh
A temporal precedence based clustering method for gene expression microarray data
BMC Bioinformatics
author_facet Buchanan-Wollaston Vicky
Li Chang-Tsun
Krishna Ritesh
author_sort Buchanan-Wollaston Vicky
title A temporal precedence based clustering method for gene expression microarray data
title_short A temporal precedence based clustering method for gene expression microarray data
title_full A temporal precedence based clustering method for gene expression microarray data
title_fullStr A temporal precedence based clustering method for gene expression microarray data
title_full_unstemmed A temporal precedence based clustering method for gene expression microarray data
title_sort temporal precedence based clustering method for gene expression microarray data
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2010-01-01
description <p>Abstract</p> <p>Background</p> <p>Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not.</p> <p>Results</p> <p>A gene-association matrix is constructed by testing temporal relationships between pairs of genes using the Granger causality test. The association matrix is further analyzed using a graph-theoretic technique to detect highly connected components representing interesting biological modules. We test our approach on synthesized datasets and real biological datasets obtained for Arabidopsis thaliana. We show the effectiveness of our approach by analyzing the results using the existing biological literature. We also report interesting structural properties of the association network commonly desired in any biological system.</p> <p>Conclusions</p> <p>Our experiments on synthesized and real microarray datasets show that our approach produces encouraging results. The method is simple in implementation and is statistically traceable at each step. The method can produce sets of functionally related genes which can be further used for reverse-engineering of gene circuits.</p>
url http://www.biomedcentral.com/1471-2105/11/68
work_keys_str_mv AT buchananwollastonvicky atemporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT lichangtsun atemporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT krishnaritesh atemporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT buchananwollastonvicky temporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT lichangtsun temporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
AT krishnaritesh temporalprecedencebasedclusteringmethodforgeneexpressionmicroarraydata
_version_ 1725142899150028800