Ordering Classifier Chains using filter model feature selection techniques
Context: Multi-label classification concerns classification with multi-dimensional output. The Classifier Chain breaks the multi-label problem into multiple binary classification problems, chaining the classifiers to exploit dependencies between labels. Consequently, its performance is influenced by...
Main Author: | |
---|---|
Format: | Others |
Language: | English |
Published: |
Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik
2017
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:se:bth-14817 |
id |
ndltd-UPSALLA1-oai-DiVA.org-bth-14817 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UPSALLA1-oai-DiVA.org-bth-148172018-01-14T05:11:32ZOrdering Classifier Chains using filter model feature selection techniquesengGustafsson, RobinBlekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik2017multi-label classificationclassifier chainlabel dependenceComputer SciencesDatavetenskap (datalogi)Context: Multi-label classification concerns classification with multi-dimensional output. The Classifier Chain breaks the multi-label problem into multiple binary classification problems, chaining the classifiers to exploit dependencies between labels. Consequently, its performance is influenced by the chain's order. Approaches to finding advantageous chain orders have been proposed, though they are typically costly. Objectives: This study explored the use of filter model feature selection techniques to order Classifier Chains. It examined how feature selection techniques can be adapted to evaluate label dependence, how such information can be used to select a chain order and how this affects the classifier's performance and execution time. Methods: An experiment was performed to evaluate the proposed approach. The two proposed algorithms, Forward-Oriented Chain Selection (FOCS) and Backward-Oriented Chain Selection (BOCS), were tested with three different feature evaluators. 10-fold cross-validation was performed on ten benchmark datasets. Performance was measured in accuracy, 0/1 subset accuracy and Hamming loss. Execution time was measured during chain selection, classifier training and testing. Results: Both proposed algorithms led to improved accuracy and 0/1 subset accuracy (Friedman & Hochberg, p < 0.05). FOCS also improved the Hamming loss while BOCS did not. Measured effect sizes ranged from 0.20 to 1.85 percentage points. Execution time was increased by less than 3 % in most cases. Conclusions: The results showed that the proposed approach can improve the Classifier Chain's performance at a low cost. The improvements appear similar to comparable techniques in magnitude but at a lower cost. It shows that feature selection techniques can be applied to chain ordering, demonstrates the viability of the approach and establishes FOCS and BOCS as alternatives worthy of further consideration. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:bth-14817application/pdfinfo:eu-repo/semantics/openAccess |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
multi-label classification classifier chain label dependence Computer Sciences Datavetenskap (datalogi) |
spellingShingle |
multi-label classification classifier chain label dependence Computer Sciences Datavetenskap (datalogi) Gustafsson, Robin Ordering Classifier Chains using filter model feature selection techniques |
description |
Context: Multi-label classification concerns classification with multi-dimensional output. The Classifier Chain breaks the multi-label problem into multiple binary classification problems, chaining the classifiers to exploit dependencies between labels. Consequently, its performance is influenced by the chain's order. Approaches to finding advantageous chain orders have been proposed, though they are typically costly. Objectives: This study explored the use of filter model feature selection techniques to order Classifier Chains. It examined how feature selection techniques can be adapted to evaluate label dependence, how such information can be used to select a chain order and how this affects the classifier's performance and execution time. Methods: An experiment was performed to evaluate the proposed approach. The two proposed algorithms, Forward-Oriented Chain Selection (FOCS) and Backward-Oriented Chain Selection (BOCS), were tested with three different feature evaluators. 10-fold cross-validation was performed on ten benchmark datasets. Performance was measured in accuracy, 0/1 subset accuracy and Hamming loss. Execution time was measured during chain selection, classifier training and testing. Results: Both proposed algorithms led to improved accuracy and 0/1 subset accuracy (Friedman & Hochberg, p < 0.05). FOCS also improved the Hamming loss while BOCS did not. Measured effect sizes ranged from 0.20 to 1.85 percentage points. Execution time was increased by less than 3 % in most cases. Conclusions: The results showed that the proposed approach can improve the Classifier Chain's performance at a low cost. The improvements appear similar to comparable techniques in magnitude but at a lower cost. It shows that feature selection techniques can be applied to chain ordering, demonstrates the viability of the approach and establishes FOCS and BOCS as alternatives worthy of further consideration. |
author |
Gustafsson, Robin |
author_facet |
Gustafsson, Robin |
author_sort |
Gustafsson, Robin |
title |
Ordering Classifier Chains using filter model feature selection techniques |
title_short |
Ordering Classifier Chains using filter model feature selection techniques |
title_full |
Ordering Classifier Chains using filter model feature selection techniques |
title_fullStr |
Ordering Classifier Chains using filter model feature selection techniques |
title_full_unstemmed |
Ordering Classifier Chains using filter model feature selection techniques |
title_sort |
ordering classifier chains using filter model feature selection techniques |
publisher |
Blekinge Tekniska Högskola, Institutionen för datalogi och datorsystemteknik |
publishDate |
2017 |
url |
http://urn.kb.se/resolve?urn=urn:nbn:se:bth-14817 |
work_keys_str_mv |
AT gustafssonrobin orderingclassifierchainsusingfiltermodelfeatureselectiontechniques |
_version_ |
1718609613358628864 |