Multiclass classification of microarray data samples with a reduced number of genes

<p>Abstract</p> <p>Background</p> <p>Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the perform...

Full description

Bibliographic Details
Main Authors: Ornella Leonardo, Tapia Elizabeth, Bulacio Pilar, Angelone Laura
Format: Article
Language:English
Published: BMC 2011-02-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/12/59
id doaj-20a71581885149f7900f2f1963530530
record_format Article
spelling doaj-20a71581885149f7900f2f19635305302020-11-25T02:45:26ZengBMCBMC Bioinformatics1471-21052011-02-011215910.1186/1471-2105-12-59Multiclass classification of microarray data samples with a reduced number of genesOrnella LeonardoTapia ElizabethBulacio PilarAngelone Laura<p>Abstract</p> <p>Background</p> <p>Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained.</p> <p>Results</p> <p>A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples.</p> <p>Conclusions</p> <p>A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples.</p> http://www.biomedcentral.com/1471-2105/12/59
collection DOAJ
language English
format Article
sources DOAJ
author Ornella Leonardo
Tapia Elizabeth
Bulacio Pilar
Angelone Laura
spellingShingle Ornella Leonardo
Tapia Elizabeth
Bulacio Pilar
Angelone Laura
Multiclass classification of microarray data samples with a reduced number of genes
BMC Bioinformatics
author_facet Ornella Leonardo
Tapia Elizabeth
Bulacio Pilar
Angelone Laura
author_sort Ornella Leonardo
title Multiclass classification of microarray data samples with a reduced number of genes
title_short Multiclass classification of microarray data samples with a reduced number of genes
title_full Multiclass classification of microarray data samples with a reduced number of genes
title_fullStr Multiclass classification of microarray data samples with a reduced number of genes
title_full_unstemmed Multiclass classification of microarray data samples with a reduced number of genes
title_sort multiclass classification of microarray data samples with a reduced number of genes
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2011-02-01
description <p>Abstract</p> <p>Background</p> <p>Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained.</p> <p>Results</p> <p>A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples.</p> <p>Conclusions</p> <p>A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples.</p>
url http://www.biomedcentral.com/1471-2105/12/59
work_keys_str_mv AT ornellaleonardo multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes
AT tapiaelizabeth multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes
AT bulaciopilar multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes
AT angelonelaura multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes
_version_ 1724762965599584256