Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification

Explosive increase of dataset features may intensify the complexity of medical data analysis in deciding necessary treatment for the patient. In most cases, the accuracy of diagnosis system is vitally impacted by the data dimensionality and classifier parameters. Since these two processes are depend...

Full description

Bibliographic Details
Main Authors: Tengku Mazlin Tengku Ab Hamid, Roselina Sallehuddin, Zuriahati Mohd Yunos, Aida Ali
Format: Article
Language:English
Published: Elsevier 2021-09-01
Series:Machine Learning with Applications
Online Access:http://www.sciencedirect.com/science/article/pii/S2666827021000281
id doaj-a155993eb2cf4f2baf1c5c717510ac19
record_format Article
spelling doaj-a155993eb2cf4f2baf1c5c717510ac192021-08-20T04:37:08ZengElsevierMachine Learning with Applications2666-82702021-09-015100054Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer ClassificationTengku Mazlin Tengku Ab Hamid0Roselina Sallehuddin1Zuriahati Mohd Yunos2Aida Ali3Corresponding author.; Faculty of Engineering, School of Computing, Universiti Teknologi Malaysia, Skudai, Johor, MalaysiaFaculty of Engineering, School of Computing, Universiti Teknologi Malaysia, Skudai, Johor, MalaysiaFaculty of Engineering, School of Computing, Universiti Teknologi Malaysia, Skudai, Johor, MalaysiaFaculty of Engineering, School of Computing, Universiti Teknologi Malaysia, Skudai, Johor, MalaysiaExplosive increase of dataset features may intensify the complexity of medical data analysis in deciding necessary treatment for the patient. In most cases, the accuracy of diagnosis system is vitally impacted by the data dimensionality and classifier parameters. Since these two processes are dependent, conducting them independently could deteriorate the accuracy performance. Filter algorithm is used to eliminate irrelevant features based on ranking. However, independent filter still incapable to consider features dependency and resulting in imbalance selection of significant features which consequently degrade the classification performance. In order to mitigate this problem, ensemble of multi filters algorithm such as Information Gain (IG), Gain Ratio (GR), Chi-squared (CS) and Relief-F (RF) are utilized as it can considers the intercorrelation between features. The proper kernel parameters settings may also influence the classification performance. Hence, a harmonize classification technique using Particle Swarm Optimization (PSO) and Support Vector Machine (SVM) is employed to optimize the searching of optimal significant features and kernel parameters synchronously without degrading the accuracy. Therefore, an ensemble filter feature selection with harmonize classification of PSO and SVM (Ensemble-PSO-SVM) are proposed in this research. The effectiveness of the proposed method is examined on standard Breast Cancer and Lymphography datasets. Experimental results showed that the proposed method successfully signify the classifier accuracy performance with optimal significant features compared to other existing methods such as PSO-SVM and classical SVM. Hence, the proposed method can be used as an alternative method for determining the optimal solution in handling high dimensional data.http://www.sciencedirect.com/science/article/pii/S2666827021000281
collection DOAJ
language English
format Article
sources DOAJ
author Tengku Mazlin Tengku Ab Hamid
Roselina Sallehuddin
Zuriahati Mohd Yunos
Aida Ali
spellingShingle Tengku Mazlin Tengku Ab Hamid
Roselina Sallehuddin
Zuriahati Mohd Yunos
Aida Ali
Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification
Machine Learning with Applications
author_facet Tengku Mazlin Tengku Ab Hamid
Roselina Sallehuddin
Zuriahati Mohd Yunos
Aida Ali
author_sort Tengku Mazlin Tengku Ab Hamid
title Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification
title_short Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification
title_full Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification
title_fullStr Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification
title_full_unstemmed Ensemble Based Filter Feature Selection with Harmonize Particle Swarm Optimization and Support Vector Machine for Optimal Cancer Classification
title_sort ensemble based filter feature selection with harmonize particle swarm optimization and support vector machine for optimal cancer classification
publisher Elsevier
series Machine Learning with Applications
issn 2666-8270
publishDate 2021-09-01
description Explosive increase of dataset features may intensify the complexity of medical data analysis in deciding necessary treatment for the patient. In most cases, the accuracy of diagnosis system is vitally impacted by the data dimensionality and classifier parameters. Since these two processes are dependent, conducting them independently could deteriorate the accuracy performance. Filter algorithm is used to eliminate irrelevant features based on ranking. However, independent filter still incapable to consider features dependency and resulting in imbalance selection of significant features which consequently degrade the classification performance. In order to mitigate this problem, ensemble of multi filters algorithm such as Information Gain (IG), Gain Ratio (GR), Chi-squared (CS) and Relief-F (RF) are utilized as it can considers the intercorrelation between features. The proper kernel parameters settings may also influence the classification performance. Hence, a harmonize classification technique using Particle Swarm Optimization (PSO) and Support Vector Machine (SVM) is employed to optimize the searching of optimal significant features and kernel parameters synchronously without degrading the accuracy. Therefore, an ensemble filter feature selection with harmonize classification of PSO and SVM (Ensemble-PSO-SVM) are proposed in this research. The effectiveness of the proposed method is examined on standard Breast Cancer and Lymphography datasets. Experimental results showed that the proposed method successfully signify the classifier accuracy performance with optimal significant features compared to other existing methods such as PSO-SVM and classical SVM. Hence, the proposed method can be used as an alternative method for determining the optimal solution in handling high dimensional data.
url http://www.sciencedirect.com/science/article/pii/S2666827021000281
work_keys_str_mv AT tengkumazlintengkuabhamid ensemblebasedfilterfeatureselectionwithharmonizeparticleswarmoptimizationandsupportvectormachineforoptimalcancerclassification
AT roselinasallehuddin ensemblebasedfilterfeatureselectionwithharmonizeparticleswarmoptimizationandsupportvectormachineforoptimalcancerclassification
AT zuriahatimohdyunos ensemblebasedfilterfeatureselectionwithharmonizeparticleswarmoptimizationandsupportvectormachineforoptimalcancerclassification
AT aidaali ensemblebasedfilterfeatureselectionwithharmonizeparticleswarmoptimizationandsupportvectormachineforoptimalcancerclassification
_version_ 1721201626413268992