Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
Clustering is a classic combined optimization problem that is widely used in pattern recognition, image processing, market analysis and so on. However, the efficiency of clustering algorithms decreases as the amount of data increases. In addition, most of the existing methods optimize only one objec...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8853276/ |
id |
doaj-d4aceeb09aed4ac4b4b633b59884409d |
---|---|
record_format |
Article |
spelling |
doaj-d4aceeb09aed4ac4b4b633b59884409d2021-04-05T17:23:16ZengIEEEIEEE Access2169-35362019-01-01714351114352310.1109/ACCESS.2019.29446758853276Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble ClusteringShuo Yan0Yuan Wang1Deting Kong2Jinyan Hu3Jianhua Qu4Xiyu Liu5Jie Xue6https://orcid.org/0000-0002-4952-5583Business School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaClustering is a classic combined optimization problem that is widely used in pattern recognition, image processing, market analysis and so on. However, the efficiency of clustering algorithms decreases as the amount of data increases. In addition, most of the existing methods optimize only one objective and therefore may be suitable only for datasets with certain features. To address these limitations, in this paper, we develop a new hybrid chain-hypergraph P system (named HCHPS), which makes full use of the parallelism of P systems as well as the advantages of chain and hypergraph topology structures for accurate and efficient clustering. Our new P system comprises three types of subsystems, i.e., reaction chain membrane subsystems, local communication membrane subsystems and global ensemble membrane subsystems. Each type of subsystems is implemented end-to-end in HCHPS with new rules and membrane structures in parallel. In particular, to obtain efficient clustering center objects and make the algorithm robust to data with various features, the reaction chain membrane subsystems perform three different multiobjective strategies simultaneously by new chain evolution rules. To increase the population diversity of cluster centers, the local communication membrane subsystems utilize transport rules between membranes for coevolution of nondominated objects. The global ensemble membrane subsystems conduct a new dense representation multisize ensemble strategy to further improve the accuracy of the final results. Evaluations on two artificial data sets and 17 real-life data sets demonstrate the robustness of the proposed method in correctly clustering data sets with different dimensions and shapes. Our experimental results outperform those of both baseline and state-of-the-art methods. Moreover, benefiting from the parallelism, HCHPS is less time consuming than other methods, featuring an average completion time of 28.07 seconds on the 17 real-life data sets. Moreover, an ablation study shows that our proposed components are critical for effective cluster analysis.https://ieeexplore.ieee.org/document/8853276/Chain-hypergraph P systemmultiobjective optimizationcluster analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Shuo Yan Yuan Wang Deting Kong Jinyan Hu Jianhua Qu Xiyu Liu Jie Xue |
spellingShingle |
Shuo Yan Yuan Wang Deting Kong Jinyan Hu Jianhua Qu Xiyu Liu Jie Xue Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering IEEE Access Chain-hypergraph P system multiobjective optimization cluster analysis |
author_facet |
Shuo Yan Yuan Wang Deting Kong Jinyan Hu Jianhua Qu Xiyu Liu Jie Xue |
author_sort |
Shuo Yan |
title |
Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering |
title_short |
Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering |
title_full |
Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering |
title_fullStr |
Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering |
title_full_unstemmed |
Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering |
title_sort |
hybrid chain-hypergraph p systems for multiobjective ensemble clustering |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Clustering is a classic combined optimization problem that is widely used in pattern recognition, image processing, market analysis and so on. However, the efficiency of clustering algorithms decreases as the amount of data increases. In addition, most of the existing methods optimize only one objective and therefore may be suitable only for datasets with certain features. To address these limitations, in this paper, we develop a new hybrid chain-hypergraph P system (named HCHPS), which makes full use of the parallelism of P systems as well as the advantages of chain and hypergraph topology structures for accurate and efficient clustering. Our new P system comprises three types of subsystems, i.e., reaction chain membrane subsystems, local communication membrane subsystems and global ensemble membrane subsystems. Each type of subsystems is implemented end-to-end in HCHPS with new rules and membrane structures in parallel. In particular, to obtain efficient clustering center objects and make the algorithm robust to data with various features, the reaction chain membrane subsystems perform three different multiobjective strategies simultaneously by new chain evolution rules. To increase the population diversity of cluster centers, the local communication membrane subsystems utilize transport rules between membranes for coevolution of nondominated objects. The global ensemble membrane subsystems conduct a new dense representation multisize ensemble strategy to further improve the accuracy of the final results. Evaluations on two artificial data sets and 17 real-life data sets demonstrate the robustness of the proposed method in correctly clustering data sets with different dimensions and shapes. Our experimental results outperform those of both baseline and state-of-the-art methods. Moreover, benefiting from the parallelism, HCHPS is less time consuming than other methods, featuring an average completion time of 28.07 seconds on the 17 real-life data sets. Moreover, an ablation study shows that our proposed components are critical for effective cluster analysis. |
topic |
Chain-hypergraph P system multiobjective optimization cluster analysis |
url |
https://ieeexplore.ieee.org/document/8853276/ |
work_keys_str_mv |
AT shuoyan hybridchainhypergraphpsystemsformultiobjectiveensembleclustering AT yuanwang hybridchainhypergraphpsystemsformultiobjectiveensembleclustering AT detingkong hybridchainhypergraphpsystemsformultiobjectiveensembleclustering AT jinyanhu hybridchainhypergraphpsystemsformultiobjectiveensembleclustering AT jianhuaqu hybridchainhypergraphpsystemsformultiobjectiveensembleclustering AT xiyuliu hybridchainhypergraphpsystemsformultiobjectiveensembleclustering AT jiexue hybridchainhypergraphpsystemsformultiobjectiveensembleclustering |
_version_ |
1721539727468789760 |