A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression

<p>Abstract</p> <p>Background</p> <p>The analysis of gene expression from time series underpins many biological studies. Two basic forms of analysis recur for data of this type: removing inactive (quiet) genes from the study and determining which genes are differentiall...

Full description

Bibliographic Details
Main Authors: Lawrence Neil D, Kalaitzis Alfredo A
Format: Article
Language:English
Published: BMC 2011-05-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/180
id doaj-acef37ad99f74eb08836cd85f02e9f20
record_format Article
spelling doaj-acef37ad99f74eb08836cd85f02e9f202020-11-24T21:16:05ZengBMCBMC Bioinformatics1471-21052011-05-0112118010.1186/1471-2105-12-180A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process RegressionLawrence Neil DKalaitzis Alfredo A<p>Abstract</p> <p>Background</p> <p>The analysis of gene expression from time series underpins many biological studies. Two basic forms of analysis recur for data of this type: removing inactive (quiet) genes from the study and determining which genes are differentially expressed. Often these analysis stages are applied disregarding the fact that the data is drawn from a time series. In this paper we propose a simple model for accounting for the underlying temporal nature of the data based on a Gaussian process.</p> <p>Results</p> <p>We review Gaussian process (GP) regression for estimating the continuous trajectories underlying in gene expression time-series. We present a simple approach which can be used to filter quiet genes, or for the case of time series in the form of expression ratios, quantify differential expression. We assess via ROC curves the rankings produced by our regression framework and compare them to a recently proposed hierarchical Bayesian model for the analysis of gene expression time-series (BATS). We compare on both simulated and experimental data showing that the proposed approach considerably outperforms the current state of the art.</p> <p>Conclusions</p> <p>Gaussian processes offer an attractive trade-off between efficiency and usability for the analysis of microarray time series. The Gaussian process framework offers a natural way of handling biological replicates and missing values and provides confidence intervals along the estimated curves of gene expression. Therefore, we believe Gaussian processes should be a standard tool in the analysis of gene expression time series.</p> http://www.biomedcentral.com/1471-2105/12/180
collection DOAJ
language English
format Article
sources DOAJ
author Lawrence Neil D
Kalaitzis Alfredo A
spellingShingle Lawrence Neil D
Kalaitzis Alfredo A
A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
BMC Bioinformatics
author_facet Lawrence Neil D
Kalaitzis Alfredo A
author_sort Lawrence Neil D
title A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
title_short A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
title_full A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
title_fullStr A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
title_full_unstemmed A Simple Approach to Ranking Differentially Expressed Gene Expression Time Courses through Gaussian Process Regression
title_sort simple approach to ranking differentially expressed gene expression time courses through gaussian process regression
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-05-01
description <p>Abstract</p> <p>Background</p> <p>The analysis of gene expression from time series underpins many biological studies. Two basic forms of analysis recur for data of this type: removing inactive (quiet) genes from the study and determining which genes are differentially expressed. Often these analysis stages are applied disregarding the fact that the data is drawn from a time series. In this paper we propose a simple model for accounting for the underlying temporal nature of the data based on a Gaussian process.</p> <p>Results</p> <p>We review Gaussian process (GP) regression for estimating the continuous trajectories underlying in gene expression time-series. We present a simple approach which can be used to filter quiet genes, or for the case of time series in the form of expression ratios, quantify differential expression. We assess via ROC curves the rankings produced by our regression framework and compare them to a recently proposed hierarchical Bayesian model for the analysis of gene expression time-series (BATS). We compare on both simulated and experimental data showing that the proposed approach considerably outperforms the current state of the art.</p> <p>Conclusions</p> <p>Gaussian processes offer an attractive trade-off between efficiency and usability for the analysis of microarray time series. The Gaussian process framework offers a natural way of handling biological replicates and missing values and provides confidence intervals along the estimated curves of gene expression. Therefore, we believe Gaussian processes should be a standard tool in the analysis of gene expression time series.</p>
url http://www.biomedcentral.com/1471-2105/12/180
work_keys_str_mv AT lawrenceneild asimpleapproachtorankingdifferentiallyexpressedgeneexpressiontimecoursesthroughgaussianprocessregression
AT kalaitzisalfredoa asimpleapproachtorankingdifferentiallyexpressedgeneexpressiontimecoursesthroughgaussianprocessregression
AT lawrenceneild simpleapproachtorankingdifferentiallyexpressedgeneexpressiontimecoursesthroughgaussianprocessregression
AT kalaitzisalfredoa simpleapproachtorankingdifferentiallyexpressedgeneexpressiontimecoursesthroughgaussianprocessregression
_version_ 1726017192072314880