Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm
<p>Abstract</p> <p>Background</p> <p>Nowadays, it is possible to collect expression levels of a set of genes from a set of biological samples during a series of time points. Such data have three dimensions: gene-sample-time (GST). Thus they are called 3D microarray gene...
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2012-04-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/13/54 |
id |
doaj-cbe42a3e17e540df90d88a27a5f92511 |
---|---|
record_format |
Article |
spelling |
doaj-cbe42a3e17e540df90d88a27a5f925112020-11-24T23:04:56ZengBMCBMC Bioinformatics1471-21052012-04-011315410.1186/1471-2105-13-54Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithmTchagang Alain BPhan SieuFamili FazelShearer HeatherFobert PierreHuang YiZou JitaoHuang DaiqingCutler AdrianLiu ZiyingPan Youlian<p>Abstract</p> <p>Background</p> <p>Nowadays, it is possible to collect expression levels of a set of genes from a set of biological samples during a series of time points. Such data have three dimensions: gene-sample-time (GST). Thus they are called 3D microarray gene expression data. To take advantage of the 3D data collected, and to fully understand the biological knowledge hidden in the GST data, novel subspace clustering algorithms have to be developed to effectively address the biological problem in the corresponding space.</p> <p>Results</p> <p>We developed a subspace clustering algorithm called Order Preserving Triclustering (OPTricluster), for 3D short time-series data mining. OPTricluster is able to identify 3D clusters with coherent evolution from a given 3D dataset using a combinatorial approach on the sample dimension, and the order preserving (OP) concept on the time dimension. The fusion of the two methodologies allows one to study similarities and differences between samples in terms of their temporal expression profile. OPTricluster has been successfully applied to four case studies: immune response in mice infected by malaria (<it>Plasmodium chabaudi</it>), systemic acquired resistance in <it>Arabidopsis thaliana</it>, similarities and differences between inner and outer cotyledon in <it>Brassica napus </it>during seed development, and to <it>Brassica napus </it>whole seed development. These studies showed that OPTricluster is robust to noise and is able to detect the similarities and differences between biological samples.</p> <p>Conclusions</p> <p>Our analysis showed that OPTricluster generally outperforms other well known clustering algorithms such as the TRICLUSTER, gTRICLUSTER and K-means; it is robust to noise and can effectively mine the biological knowledge hidden in the 3D short time-series gene expression data.</p> http://www.biomedcentral.com/1471-2105/13/54 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tchagang Alain B Phan Sieu Famili Fazel Shearer Heather Fobert Pierre Huang Yi Zou Jitao Huang Daiqing Cutler Adrian Liu Ziying Pan Youlian |
spellingShingle |
Tchagang Alain B Phan Sieu Famili Fazel Shearer Heather Fobert Pierre Huang Yi Zou Jitao Huang Daiqing Cutler Adrian Liu Ziying Pan Youlian Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm BMC Bioinformatics |
author_facet |
Tchagang Alain B Phan Sieu Famili Fazel Shearer Heather Fobert Pierre Huang Yi Zou Jitao Huang Daiqing Cutler Adrian Liu Ziying Pan Youlian |
author_sort |
Tchagang Alain B |
title |
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm |
title_short |
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm |
title_full |
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm |
title_fullStr |
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm |
title_full_unstemmed |
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm |
title_sort |
mining biological information from 3d short time-series gene expression data: the optricluster algorithm |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2012-04-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Nowadays, it is possible to collect expression levels of a set of genes from a set of biological samples during a series of time points. Such data have three dimensions: gene-sample-time (GST). Thus they are called 3D microarray gene expression data. To take advantage of the 3D data collected, and to fully understand the biological knowledge hidden in the GST data, novel subspace clustering algorithms have to be developed to effectively address the biological problem in the corresponding space.</p> <p>Results</p> <p>We developed a subspace clustering algorithm called Order Preserving Triclustering (OPTricluster), for 3D short time-series data mining. OPTricluster is able to identify 3D clusters with coherent evolution from a given 3D dataset using a combinatorial approach on the sample dimension, and the order preserving (OP) concept on the time dimension. The fusion of the two methodologies allows one to study similarities and differences between samples in terms of their temporal expression profile. OPTricluster has been successfully applied to four case studies: immune response in mice infected by malaria (<it>Plasmodium chabaudi</it>), systemic acquired resistance in <it>Arabidopsis thaliana</it>, similarities and differences between inner and outer cotyledon in <it>Brassica napus </it>during seed development, and to <it>Brassica napus </it>whole seed development. These studies showed that OPTricluster is robust to noise and is able to detect the similarities and differences between biological samples.</p> <p>Conclusions</p> <p>Our analysis showed that OPTricluster generally outperforms other well known clustering algorithms such as the TRICLUSTER, gTRICLUSTER and K-means; it is robust to noise and can effectively mine the biological knowledge hidden in the 3D short time-series gene expression data.</p> |
url |
http://www.biomedcentral.com/1471-2105/13/54 |
work_keys_str_mv |
AT tchagangalainb miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT phansieu miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT familifazel miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT shearerheather miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT fobertpierre miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT huangyi miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT zoujitao miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT huangdaiqing miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT cutleradrian miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT liuziying miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm AT panyoulian miningbiologicalinformationfrom3dshorttimeseriesgeneexpressiondatatheoptriclusteralgorithm |
_version_ |
1725628302205386752 |