Refining gene signatures: a Bayesian approach

Abstract Background In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested...

Full description

Bibliographic Details
Main Authors:	Labbe Aurélie, Djebbari Amira
Format:	Article
Language:	English
Published:	BMC 2009-12-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/10/410

id	doaj-4f8e501d6cf548e889128ff909a36f72
record_format	Article
spelling	doaj-4f8e501d6cf548e889128ff909a36f722020-11-25T01:26:47ZengBMCBMC Bioinformatics1471-21052009-12-0110141010.1186/1471-2105-10-410Refining gene signatures: a Bayesian approachLabbe AurélieDjebbari Amira<p>Abstract</p> <p>Background</p> <p>In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested in the question of how many and which genes should be selected for a disease class prediction. Our work consists of a Bayesian supervised statistical learning approach to refine gene signatures with a regularization which penalizes for the correlation between the variables selected.</p> <p>Results</p> <p>Our simulation results show that we can most often recover the correct subset of genes that predict the class as compared to other methods, even when accuracy and subset size remain the same. On real microarray datasets, we show that our approach can refine gene signatures to obtain either the same or better predictive performance than other existing methods with a smaller number of genes.</p> <p>Conclusions</p> <p>Our novel Bayesian approach includes a prior which penalizes highly correlated features in model selection and is able to extract key genes in the highly correlated context of microarray data. The methodology in the paper is described in the context of microarray data, but can be applied to any array data (such as micro RNA, for example) as a first step towards predictive modeling of cancer pathways. A user-friendly software implementation of the method is available.</p> http://www.biomedcentral.com/1471-2105/10/410
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Labbe Aurélie Djebbari Amira
spellingShingle	Labbe Aurélie Djebbari Amira Refining gene signatures: a Bayesian approach BMC Bioinformatics
author_facet	Labbe Aurélie Djebbari Amira
author_sort	Labbe Aurélie
title	Refining gene signatures: a Bayesian approach
title_short	Refining gene signatures: a Bayesian approach
title_full	Refining gene signatures: a Bayesian approach
title_fullStr	Refining gene signatures: a Bayesian approach
title_full_unstemmed	Refining gene signatures: a Bayesian approach
title_sort	refining gene signatures: a bayesian approach
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2009-12-01
description	<p>Abstract</p> <p>Background</p> <p>In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested in the question of how many and which genes should be selected for a disease class prediction. Our work consists of a Bayesian supervised statistical learning approach to refine gene signatures with a regularization which penalizes for the correlation between the variables selected.</p> <p>Results</p> <p>Our simulation results show that we can most often recover the correct subset of genes that predict the class as compared to other methods, even when accuracy and subset size remain the same. On real microarray datasets, we show that our approach can refine gene signatures to obtain either the same or better predictive performance than other existing methods with a smaller number of genes.</p> <p>Conclusions</p> <p>Our novel Bayesian approach includes a prior which penalizes highly correlated features in model selection and is able to extract key genes in the highly correlated context of microarray data. The methodology in the paper is described in the context of microarray data, but can be applied to any array data (such as micro RNA, for example) as a first step towards predictive modeling of cancer pathways. A user-friendly software implementation of the method is available.</p>
url	http://www.biomedcentral.com/1471-2105/10/410
work_keys_str_mv	AT labbeaurelie refininggenesignaturesabayesianapproach AT djebbariamira refininggenesignaturesabayesianapproach
_version_	1725109036697780224

Refining gene signatures: a Bayesian approach

Similar Items