An Efficient Ensemble Learning Method for Gene Microarray Classification

The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis....

Full description

Bibliographic Details
Main Authors: Alireza Osareh, Bita Shadgar
Format: Article
Language:English
Published: Hindawi Limited 2013-01-01
Series:BioMed Research International
Online Access:http://dx.doi.org/10.1155/2013/478410
id doaj-3556185dfb4b4e7b80da08f6558bb800
record_format Article
spelling doaj-3556185dfb4b4e7b80da08f6558bb8002020-11-24T23:46:42ZengHindawi LimitedBioMed Research International2314-61332314-61412013-01-01201310.1155/2013/478410478410An Efficient Ensemble Learning Method for Gene Microarray ClassificationAlireza Osareh0Bita Shadgar1Department of Computer Engineering, Islamic Azad University, Dezful Branch, Dezful 313, IranDepartment of Computer Engineering, Islamic Azad University, Dezful Branch, Dezful 313, IranThe gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.http://dx.doi.org/10.1155/2013/478410
collection DOAJ
language English
format Article
sources DOAJ
author Alireza Osareh
Bita Shadgar
spellingShingle Alireza Osareh
Bita Shadgar
An Efficient Ensemble Learning Method for Gene Microarray Classification
BioMed Research International
author_facet Alireza Osareh
Bita Shadgar
author_sort Alireza Osareh
title An Efficient Ensemble Learning Method for Gene Microarray Classification
title_short An Efficient Ensemble Learning Method for Gene Microarray Classification
title_full An Efficient Ensemble Learning Method for Gene Microarray Classification
title_fullStr An Efficient Ensemble Learning Method for Gene Microarray Classification
title_full_unstemmed An Efficient Ensemble Learning Method for Gene Microarray Classification
title_sort efficient ensemble learning method for gene microarray classification
publisher Hindawi Limited
series BioMed Research International
issn 2314-6133
2314-6141
publishDate 2013-01-01
description The gene microarray analysis and classification have demonstrated an effective way for the effective diagnosis of diseases and cancers. However, it has been also revealed that the basic classification techniques have intrinsic drawbacks in achieving accurate gene classification and cancer diagnosis. On the other hand, classifier ensembles have received increasing attention in various applications. Here, we address the gene classification issue using RotBoost ensemble methodology. This method is a combination of Rotation Forest and AdaBoost techniques which in turn preserve both desirable features of an ensemble architecture, that is, accuracy and diversity. To select a concise subset of informative genes, 5 different feature selection algorithms are considered. To assess the efficiency of the RotBoost, other nonensemble/ensemble techniques including Decision Trees, Support Vector Machines, Rotation Forest, AdaBoost, and Bagging are also deployed. Experimental results have revealed that the combination of the fast correlation-based feature selection method with ICA-based RotBoost ensemble is highly effective for gene classification. In fact, the proposed method can create ensemble classifiers which outperform not only the classifiers produced by the conventional machine learning but also the classifiers generated by two widely used conventional ensemble learning methods, that is, Bagging and AdaBoost.
url http://dx.doi.org/10.1155/2013/478410
work_keys_str_mv AT alirezaosareh anefficientensemblelearningmethodforgenemicroarrayclassification
AT bitashadgar anefficientensemblelearningmethodforgenemicroarrayclassification
AT alirezaosareh efficientensemblelearningmethodforgenemicroarrayclassification
AT bitashadgar efficientensemblelearningmethodforgenemicroarrayclassification
_version_ 1725492785773019136