A Feature Selection Algorithm Performance Metric for Comparative Analysis
This study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value gained by applying feature selection. The BFI mea...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-03-01
|
Series: | Algorithms |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4893/14/3/100 |
id |
doaj-16c69f57b7ad4e5caee007dfa6f7e945 |
---|---|
record_format |
Article |
spelling |
doaj-16c69f57b7ad4e5caee007dfa6f7e9452021-03-23T00:06:28ZengMDPI AGAlgorithms1999-48932021-03-011410010010.3390/a14030100A Feature Selection Algorithm Performance Metric for Comparative AnalysisWerner Mostert0Katherine M. Malan1Andries P. Engelbrecht2Department of Industrial Engineering, Division of Computer Science, Stellenbosch University, Stellenbosch 7600, South AfricaDepartment of Decision Sciences, University of South Africa, Pretoria 7701, South AfricaDepartment of Industrial Engineering, Division of Computer Science, Stellenbosch University, Stellenbosch 7600, South AfricaThis study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value gained by applying feature selection. The BFI measure can be used to compare the performance of feature selection algorithms across datasets by measuring the change in classifier performance as a result of feature selection, with respect to the baseline where all features are included. Empirical results are presented to show that there is performance complementarity for a suite of feature selection algorithms on a variety of real world datasets. The BFI measure is a normalised performance metric that can be used to correlate problem characteristics with feature selection algorithm performance, across multiple datasets. This ability paves the way towards describing the performance space of the per-instance algorithm selection problem for feature selection algorithms.https://www.mdpi.com/1999-4893/14/3/100feature selectionbaseline fitness improvementperformance analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Werner Mostert Katherine M. Malan Andries P. Engelbrecht |
spellingShingle |
Werner Mostert Katherine M. Malan Andries P. Engelbrecht A Feature Selection Algorithm Performance Metric for Comparative Analysis Algorithms feature selection baseline fitness improvement performance analysis |
author_facet |
Werner Mostert Katherine M. Malan Andries P. Engelbrecht |
author_sort |
Werner Mostert |
title |
A Feature Selection Algorithm Performance Metric for Comparative Analysis |
title_short |
A Feature Selection Algorithm Performance Metric for Comparative Analysis |
title_full |
A Feature Selection Algorithm Performance Metric for Comparative Analysis |
title_fullStr |
A Feature Selection Algorithm Performance Metric for Comparative Analysis |
title_full_unstemmed |
A Feature Selection Algorithm Performance Metric for Comparative Analysis |
title_sort |
feature selection algorithm performance metric for comparative analysis |
publisher |
MDPI AG |
series |
Algorithms |
issn |
1999-4893 |
publishDate |
2021-03-01 |
description |
This study presents a novel performance metric for feature selection algorithms that is unbiased and can be used for comparative analysis across feature selection problems. The baseline fitness improvement (BFI) measure quantifies the potential value gained by applying feature selection. The BFI measure can be used to compare the performance of feature selection algorithms across datasets by measuring the change in classifier performance as a result of feature selection, with respect to the baseline where all features are included. Empirical results are presented to show that there is performance complementarity for a suite of feature selection algorithms on a variety of real world datasets. The BFI measure is a normalised performance metric that can be used to correlate problem characteristics with feature selection algorithm performance, across multiple datasets. This ability paves the way towards describing the performance space of the per-instance algorithm selection problem for feature selection algorithms. |
topic |
feature selection baseline fitness improvement performance analysis |
url |
https://www.mdpi.com/1999-4893/14/3/100 |
work_keys_str_mv |
AT wernermostert afeatureselectionalgorithmperformancemetricforcomparativeanalysis AT katherinemmalan afeatureselectionalgorithmperformancemetricforcomparativeanalysis AT andriespengelbrecht afeatureselectionalgorithmperformancemetricforcomparativeanalysis AT wernermostert featureselectionalgorithmperformancemetricforcomparativeanalysis AT katherinemmalan featureselectionalgorithmperformancemetricforcomparativeanalysis AT andriespengelbrecht featureselectionalgorithmperformancemetricforcomparativeanalysis |
_version_ |
1724206877391192064 |