An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data
<p>Abstract</p> <p>Background</p> <p>Generally speaking, different classifiers tend to work well for certain types of data and conversely, it is usually not known a priori which algorithm will be optimal in any given classification application. In addition, for most cla...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2010-08-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/11/427 |
id |
doaj-283dea6687c54c84b31eb9dc44767ffc |
---|---|
record_format |
Article |
spelling |
doaj-283dea6687c54c84b31eb9dc44767ffc2020-11-25T00:50:09ZengBMCBMC Bioinformatics1471-21052010-08-0111142710.1186/1471-2105-11-427An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional dataDatta SusmitaPihur VasylDatta Somnath<p>Abstract</p> <p>Background</p> <p>Generally speaking, different classifiers tend to work well for certain types of data and conversely, it is usually not known a priori which algorithm will be optimal in any given classification application. In addition, for most classification problems, selecting the best performing classification algorithm amongst a number of competing algorithms is a difficult task for various reasons. As for example, the order of performance may depend on the performance measure employed for such a comparison. In this work, we present a novel adaptive ensemble classifier constructed by combining bagging and rank aggregation that is capable of adaptively changing its performance depending on the type of data that is being classified. The attractive feature of the proposed classifier is its multi-objective nature where the classification results can be simultaneously optimized with respect to several performance measures, for example, accuracy, sensitivity and specificity. We also show that our somewhat complex strategy has better predictive performance as judged on test samples than a more naive approach that attempts to directly identify the optimal classifier based on the training data performances of the individual classifiers.</p> <p>Results</p> <p>We illustrate the proposed method with two simulated and two real-data examples. In all cases, the ensemble classifier performs at the level of the best individual classifier comprising the ensemble or better.</p> <p>Conclusions</p> <p>For complex high-dimensional datasets resulting from present day high-throughput experiments, it may be wise to consider a number of classification algorithms combined with dimension reduction techniques rather than a fixed standard algorithm set a priori.</p> http://www.biomedcentral.com/1471-2105/11/427 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Datta Susmita Pihur Vasyl Datta Somnath |
spellingShingle |
Datta Susmita Pihur Vasyl Datta Somnath An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data BMC Bioinformatics |
author_facet |
Datta Susmita Pihur Vasyl Datta Somnath |
author_sort |
Datta Susmita |
title |
An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data |
title_short |
An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data |
title_full |
An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data |
title_fullStr |
An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data |
title_full_unstemmed |
An adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data |
title_sort |
adaptive optimal ensemble classifier via bagging and rank aggregation with applications to high dimensional data |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2010-08-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Generally speaking, different classifiers tend to work well for certain types of data and conversely, it is usually not known a priori which algorithm will be optimal in any given classification application. In addition, for most classification problems, selecting the best performing classification algorithm amongst a number of competing algorithms is a difficult task for various reasons. As for example, the order of performance may depend on the performance measure employed for such a comparison. In this work, we present a novel adaptive ensemble classifier constructed by combining bagging and rank aggregation that is capable of adaptively changing its performance depending on the type of data that is being classified. The attractive feature of the proposed classifier is its multi-objective nature where the classification results can be simultaneously optimized with respect to several performance measures, for example, accuracy, sensitivity and specificity. We also show that our somewhat complex strategy has better predictive performance as judged on test samples than a more naive approach that attempts to directly identify the optimal classifier based on the training data performances of the individual classifiers.</p> <p>Results</p> <p>We illustrate the proposed method with two simulated and two real-data examples. In all cases, the ensemble classifier performs at the level of the best individual classifier comprising the ensemble or better.</p> <p>Conclusions</p> <p>For complex high-dimensional datasets resulting from present day high-throughput experiments, it may be wise to consider a number of classification algorithms combined with dimension reduction techniques rather than a fixed standard algorithm set a priori.</p> |
url |
http://www.biomedcentral.com/1471-2105/11/427 |
work_keys_str_mv |
AT dattasusmita anadaptiveoptimalensembleclassifierviabaggingandrankaggregationwithapplicationstohighdimensionaldata AT pihurvasyl anadaptiveoptimalensembleclassifierviabaggingandrankaggregationwithapplicationstohighdimensionaldata AT dattasomnath anadaptiveoptimalensembleclassifierviabaggingandrankaggregationwithapplicationstohighdimensionaldata AT dattasusmita adaptiveoptimalensembleclassifierviabaggingandrankaggregationwithapplicationstohighdimensionaldata AT pihurvasyl adaptiveoptimalensembleclassifierviabaggingandrankaggregationwithapplicationstohighdimensionaldata AT dattasomnath adaptiveoptimalensembleclassifierviabaggingandrankaggregationwithapplicationstohighdimensionaldata |
_version_ |
1725249060721393664 |