A Feature Selection Algorithm Performance Metric for Comparative Analysis

This study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value gained by applying feature selection. The BFI mea...

Full description

Bibliographic Details
Main Authors: Werner Mostert, Katherine M. Malan, Andries P. Engelbrecht
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/14/3/100
id doaj-16c69f57b7ad4e5caee007dfa6f7e945
record_format Article
spelling doaj-16c69f57b7ad4e5caee007dfa6f7e9452021-03-23T00:06:28ZengMDPI AGAlgorithms1999-48932021-03-011410010010.3390/a14030100A Feature Selection Algorithm Performance Metric for Comparative AnalysisWerner Mostert0Katherine M. Malan1Andries P. Engelbrecht2Department of Industrial Engineering, Division of Computer Science, Stellenbosch University, Stellenbosch 7600, South AfricaDepartment of Decision Sciences, University of South Africa, Pretoria 7701, South AfricaDepartment of Industrial Engineering, Division of Computer Science, Stellenbosch University, Stellenbosch 7600, South AfricaThis study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value gained by applying feature selection. The BFI measure can be used to compare the performance of feature selection algorithms across datasets by measuring the change in classifier performance as a result of feature selection, with respect to the baseline where all features are included. Empirical results are presented to show that there is performance complementarity for a suite of feature selection algorithms on a variety of real world datasets. The BFI measure is a normalised performance metric that can be used to correlate problem characteristics with feature selection algorithm performance, across multiple datasets. This ability paves the way towards describing the performance space of the per-instance algorithm selection problem for feature selection algorithms.https://www.mdpi.com/1999-4893/14/3/100feature selectionbaseline fitness improvementperformance analysis
collection DOAJ
language English
format Article
sources DOAJ
author Werner Mostert
Katherine M. Malan
Andries P. Engelbrecht
spellingShingle Werner Mostert
Katherine M. Malan
Andries P. Engelbrecht
A Feature Selection Algorithm Performance Metric for Comparative Analysis
Algorithms
feature selection
baseline fitness improvement
performance analysis
author_facet Werner Mostert
Katherine M. Malan
Andries P. Engelbrecht
author_sort Werner Mostert
title A Feature Selection Algorithm Performance Metric for Comparative Analysis
title_short A Feature Selection Algorithm Performance Metric for Comparative Analysis
title_full A Feature Selection Algorithm Performance Metric for Comparative Analysis
title_fullStr A Feature Selection Algorithm Performance Metric for Comparative Analysis
title_full_unstemmed A Feature Selection Algorithm Performance Metric for Comparative Analysis
title_sort feature selection algorithm performance metric for comparative analysis
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2021-03-01
description This study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value gained by applying feature selection. The BFI measure can be used to compare the performance of feature selection algorithms across datasets by measuring the change in classifier performance as a result of feature selection, with respect to the baseline where all features are included. Empirical results are presented to show that there is performance complementarity for a suite of feature selection algorithms on a variety of real world datasets. The BFI measure is a normalised performance metric that can be used to correlate problem characteristics with feature selection algorithm performance, across multiple datasets. This ability paves the way towards describing the performance space of the per-instance algorithm selection problem for feature selection algorithms.
topic feature selection
baseline fitness improvement
performance analysis
url https://www.mdpi.com/1999-4893/14/3/100
work_keys_str_mv AT wernermostert afeatureselectionalgorithmperformancemetricforcomparativeanalysis
AT katherinemmalan afeatureselectionalgorithmperformancemetricforcomparativeanalysis
AT andriespengelbrecht afeatureselectionalgorithmperformancemetricforcomparativeanalysis
AT wernermostert featureselectionalgorithmperformancemetricforcomparativeanalysis
AT katherinemmalan featureselectionalgorithmperformancemetricforcomparativeanalysis
AT andriespengelbrecht featureselectionalgorithmperformancemetricforcomparativeanalysis
_version_ 1724206877391192064