Feature Selection and Overlapping Clustering-Based Multilabel Classification Model

Multilabel classification (MLC) learning, which is widely applied in real-world applications, is a very important problem in machine learning. Some studies show that a clustering-based MLC framework performs effectively compared to a nonclustering framework. In this paper, we explore the clustering-...

Full description

Bibliographic Details
Main Authors: Liwen Peng, Yongguo Liu
Format: Article
Language:English
Published: Hindawi Limited 2018-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2018/2814897
id doaj-eeeb3443166141fab2f4dc8d5e64d261
record_format Article
spelling doaj-eeeb3443166141fab2f4dc8d5e64d2612020-11-24T22:36:21ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472018-01-01201810.1155/2018/28148972814897Feature Selection and Overlapping Clustering-Based Multilabel Classification ModelLiwen Peng0Yongguo Liu1Knowledge and Data Engineering Laboratory of Chinese Medicine, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, ChinaKnowledge and Data Engineering Laboratory of Chinese Medicine, School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, ChinaMultilabel classification (MLC) learning, which is widely applied in real-world applications, is a very important problem in machine learning. Some studies show that a clustering-based MLC framework performs effectively compared to a nonclustering framework. In this paper, we explore the clustering-based MLC problem. Multilabel feature selection also plays an important role in classification learning because many redundant and irrelevant features can degrade performance and a good feature selection algorithm can reduce computational complexity and improve classification accuracy. In this study, we consider feature dependence and feature interaction simultaneously, and we propose a multilabel feature selection algorithm as a preprocessing stage before MLC. Typically, existing cluster-based MLC frameworks employ a hard cluster method. In practice, the instances of multilabel datasets are distinguished in a single cluster by such frameworks; however, the overlapping nature of multilabel instances is such that, in real-life applications, instances may not belong to only a single class. Therefore, we propose a MLC model that combines feature selection with an overlapping clustering algorithm. Experimental results demonstrate that various clustering algorithms show different performance for MLC, and the proposed overlapping clustering-based MLC model may be more suitable.http://dx.doi.org/10.1155/2018/2814897
collection DOAJ
language English
format Article
sources DOAJ
author Liwen Peng
Yongguo Liu
spellingShingle Liwen Peng
Yongguo Liu
Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
Mathematical Problems in Engineering
author_facet Liwen Peng
Yongguo Liu
author_sort Liwen Peng
title Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
title_short Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
title_full Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
title_fullStr Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
title_full_unstemmed Feature Selection and Overlapping Clustering-Based Multilabel Classification Model
title_sort feature selection and overlapping clustering-based multilabel classification model
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2018-01-01
description Multilabel classification (MLC) learning, which is widely applied in real-world applications, is a very important problem in machine learning. Some studies show that a clustering-based MLC framework performs effectively compared to a nonclustering framework. In this paper, we explore the clustering-based MLC problem. Multilabel feature selection also plays an important role in classification learning because many redundant and irrelevant features can degrade performance and a good feature selection algorithm can reduce computational complexity and improve classification accuracy. In this study, we consider feature dependence and feature interaction simultaneously, and we propose a multilabel feature selection algorithm as a preprocessing stage before MLC. Typically, existing cluster-based MLC frameworks employ a hard cluster method. In practice, the instances of multilabel datasets are distinguished in a single cluster by such frameworks; however, the overlapping nature of multilabel instances is such that, in real-life applications, instances may not belong to only a single class. Therefore, we propose a MLC model that combines feature selection with an overlapping clustering algorithm. Experimental results demonstrate that various clustering algorithms show different performance for MLC, and the proposed overlapping clustering-based MLC model may be more suitable.
url http://dx.doi.org/10.1155/2018/2814897
work_keys_str_mv AT liwenpeng featureselectionandoverlappingclusteringbasedmultilabelclassificationmodel
AT yongguoliu featureselectionandoverlappingclusteringbasedmultilabelclassificationmodel
_version_ 1725720716531204096