Multiclass classification of microarray data samples with a reduced number of genes
<p>Abstract</p> <p>Background</p> <p>Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the perform...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2011-02-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/12/59 |
id |
doaj-20a71581885149f7900f2f1963530530 |
---|---|
record_format |
Article |
spelling |
doaj-20a71581885149f7900f2f19635305302020-11-25T02:45:26ZengBMCBMC Bioinformatics1471-21052011-02-011215910.1186/1471-2105-12-59Multiclass classification of microarray data samples with a reduced number of genesOrnella LeonardoTapia ElizabethBulacio PilarAngelone Laura<p>Abstract</p> <p>Background</p> <p>Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained.</p> <p>Results</p> <p>A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples.</p> <p>Conclusions</p> <p>A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples.</p> http://www.biomedcentral.com/1471-2105/12/59 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ornella Leonardo Tapia Elizabeth Bulacio Pilar Angelone Laura |
spellingShingle |
Ornella Leonardo Tapia Elizabeth Bulacio Pilar Angelone Laura Multiclass classification of microarray data samples with a reduced number of genes BMC Bioinformatics |
author_facet |
Ornella Leonardo Tapia Elizabeth Bulacio Pilar Angelone Laura |
author_sort |
Ornella Leonardo |
title |
Multiclass classification of microarray data samples with a reduced number of genes |
title_short |
Multiclass classification of microarray data samples with a reduced number of genes |
title_full |
Multiclass classification of microarray data samples with a reduced number of genes |
title_fullStr |
Multiclass classification of microarray data samples with a reduced number of genes |
title_full_unstemmed |
Multiclass classification of microarray data samples with a reduced number of genes |
title_sort |
multiclass classification of microarray data samples with a reduced number of genes |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2011-02-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained.</p> <p>Results</p> <p>A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples.</p> <p>Conclusions</p> <p>A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples.</p> |
url |
http://www.biomedcentral.com/1471-2105/12/59 |
work_keys_str_mv |
AT ornellaleonardo multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes AT tapiaelizabeth multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes AT bulaciopilar multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes AT angelonelaura multiclassclassificationofmicroarraydatasampleswithareducednumberofgenes |
_version_ |
1724762965599584256 |