Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
Multilabel classification (MLC) learning, which is widely applied in real-world applications, is a very important problem in machine learning. Some studies show that a clustering-based MLC framework performs effectively compared to a nonclustering framework. In this paper, we explore the clustering-...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2018-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2018/2814897 |
id |
doaj-eeeb3443166141fab2f4dc8d5e64d261 |
---|---|
record_format |
Article |
spelling |
doaj-eeeb3443166141fab2f4dc8d5e64d2612020-11-24T22:36:21ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472018-01-01201810.1155/2018/28148972814897Feature Selection and Overlapping Clustering-Based Multilabel Classification ModelLiwen Peng0Yongguo Liu1Knowledge and Data Engineering Laboratory of Chinese Medicine, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, ChinaKnowledge and Data Engineering Laboratory of Chinese Medicine, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, ChinaMultilabel classification (MLC) learning, which is widely applied in real-world applications, is a very important problem in machine learning. Some studies show that a clustering-based MLC framework performs effectively compared to a nonclustering framework. In this paper, we explore the clustering-based MLC problem. Multilabel feature selection also plays an important role in classification learning because many redundant and irrelevant features can degrade performance and a good feature selection algorithm can reduce computational complexity and improve classification accuracy. In this study, we consider feature dependence and feature interaction simultaneously, and we propose a multilabel feature selection algorithm as a preprocessing stage before MLC. Typically, existing cluster-based MLC frameworks employ a hard cluster method. In practice, the instances of multilabel datasets are distinguished in a single cluster by such frameworks; however, the overlapping nature of multilabel instances is such that, in real-life applications, instances may not belong to only a single class. Therefore, we propose a MLC model that combines feature selection with an overlapping clustering algorithm. Experimental results demonstrate that various clustering algorithms show different performance for MLC, and the proposed overlapping clustering-based MLC model may be more suitable.http://dx.doi.org/10.1155/2018/2814897 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liwen Peng Yongguo Liu |
spellingShingle |
Liwen Peng Yongguo Liu Feature Selection and Overlapping Clustering-Based Multilabel Classification Model Mathematical Problems in Engineering |
author_facet |
Liwen Peng Yongguo Liu |
author_sort |
Liwen Peng |
title |
Feature Selection and Overlapping Clustering-Based Multilabel Classification Model |
title_short |
Feature Selection and Overlapping Clustering-Based Multilabel Classification Model |
title_full |
Feature Selection and Overlapping Clustering-Based Multilabel Classification Model |
title_fullStr |
Feature Selection and Overlapping Clustering-Based Multilabel Classification Model |
title_full_unstemmed |
Feature Selection and Overlapping Clustering-Based Multilabel Classification Model |
title_sort |
feature selection and overlapping clustering-based multilabel classification model |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2018-01-01 |
description |
Multilabel classification (MLC) learning, which is widely applied in real-world applications, is a very important problem in machine learning. Some studies show that a clustering-based MLC framework performs effectively compared to a nonclustering framework. In this paper, we explore the clustering-based MLC problem. Multilabel feature selection also plays an important role in classification learning because many redundant and irrelevant features can degrade performance and a good feature selection algorithm can reduce computational complexity and improve classification accuracy. In this study, we consider feature dependence and feature interaction simultaneously, and we propose a multilabel feature selection algorithm as a preprocessing stage before MLC. Typically, existing cluster-based MLC frameworks employ a hard cluster method. In practice, the instances of multilabel datasets are distinguished in a single cluster by such frameworks; however, the overlapping nature of multilabel instances is such that, in real-life applications, instances may not belong to only a single class. Therefore, we propose a MLC model that combines feature selection with an overlapping clustering algorithm. Experimental results demonstrate that various clustering algorithms show different performance for MLC, and the proposed overlapping clustering-based MLC model may be more suitable. |
url |
http://dx.doi.org/10.1155/2018/2814897 |
work_keys_str_mv |
AT liwenpeng featureselectionandoverlappingclusteringbasedmultilabelclassificationmodel AT yongguoliu featureselectionandoverlappingclusteringbasedmultilabelclassificationmodel |
_version_ |
1725720716531204096 |