Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering

Clustering is a classic combined optimization problem that is widely used in pattern recognition, image processing, market analysis and so on. However, the efficiency of clustering algorithms decreases as the amount of data increases. In addition, most of the existing methods optimize only one objec...

Full description

Bibliographic Details
Main Authors: Shuo Yan, Yuan Wang, Deting Kong, Jinyan Hu, Jianhua Qu, Xiyu Liu, Jie Xue
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8853276/
id doaj-d4aceeb09aed4ac4b4b633b59884409d
record_format Article
spelling doaj-d4aceeb09aed4ac4b4b633b59884409d2021-04-05T17:23:16ZengIEEEIEEE Access2169-35362019-01-01714351114352310.1109/ACCESS.2019.29446758853276Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble ClusteringShuo Yan0Yuan Wang1Deting Kong2Jinyan Hu3Jianhua Qu4Xiyu Liu5Jie Xue6https://orcid.org/0000-0002-4952-5583Business School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaBusiness School, Shandong Normal University, Jinan, ChinaClustering is a classic combined optimization problem that is widely used in pattern recognition, image processing, market analysis and so on. However, the efficiency of clustering algorithms decreases as the amount of data increases. In addition, most of the existing methods optimize only one objective and therefore may be suitable only for datasets with certain features. To address these limitations, in this paper, we develop a new hybrid chain-hypergraph P system (named HCHPS), which makes full use of the parallelism of P systems as well as the advantages of chain and hypergraph topology structures for accurate and efficient clustering. Our new P system comprises three types of subsystems, i.e., reaction chain membrane subsystems, local communication membrane subsystems and global ensemble membrane subsystems. Each type of subsystems is implemented end-to-end in HCHPS with new rules and membrane structures in parallel. In particular, to obtain efficient clustering center objects and make the algorithm robust to data with various features, the reaction chain membrane subsystems perform three different multiobjective strategies simultaneously by new chain evolution rules. To increase the population diversity of cluster centers, the local communication membrane subsystems utilize transport rules between membranes for coevolution of nondominated objects. The global ensemble membrane subsystems conduct a new dense representation multisize ensemble strategy to further improve the accuracy of the final results. Evaluations on two artificial data sets and 17 real-life data sets demonstrate the robustness of the proposed method in correctly clustering data sets with different dimensions and shapes. Our experimental results outperform those of both baseline and state-of-the-art methods. Moreover, benefiting from the parallelism, HCHPS is less time consuming than other methods, featuring an average completion time of 28.07 seconds on the 17 real-life data sets. Moreover, an ablation study shows that our proposed components are critical for effective cluster analysis.https://ieeexplore.ieee.org/document/8853276/Chain-hypergraph P systemmultiobjective optimizationcluster analysis
collection DOAJ
language English
format Article
sources DOAJ
author Shuo Yan
Yuan Wang
Deting Kong
Jinyan Hu
Jianhua Qu
Xiyu Liu
Jie Xue
spellingShingle Shuo Yan
Yuan Wang
Deting Kong
Jinyan Hu
Jianhua Qu
Xiyu Liu
Jie Xue
Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
IEEE Access
Chain-hypergraph P system
multiobjective optimization
cluster analysis
author_facet Shuo Yan
Yuan Wang
Deting Kong
Jinyan Hu
Jianhua Qu
Xiyu Liu
Jie Xue
author_sort Shuo Yan
title Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
title_short Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
title_full Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
title_fullStr Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
title_full_unstemmed Hybrid Chain-Hypergraph P Systems for Multiobjective Ensemble Clustering
title_sort hybrid chain-hypergraph p systems for multiobjective ensemble clustering
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Clustering is a classic combined optimization problem that is widely used in pattern recognition, image processing, market analysis and so on. However, the efficiency of clustering algorithms decreases as the amount of data increases. In addition, most of the existing methods optimize only one objective and therefore may be suitable only for datasets with certain features. To address these limitations, in this paper, we develop a new hybrid chain-hypergraph P system (named HCHPS), which makes full use of the parallelism of P systems as well as the advantages of chain and hypergraph topology structures for accurate and efficient clustering. Our new P system comprises three types of subsystems, i.e., reaction chain membrane subsystems, local communication membrane subsystems and global ensemble membrane subsystems. Each type of subsystems is implemented end-to-end in HCHPS with new rules and membrane structures in parallel. In particular, to obtain efficient clustering center objects and make the algorithm robust to data with various features, the reaction chain membrane subsystems perform three different multiobjective strategies simultaneously by new chain evolution rules. To increase the population diversity of cluster centers, the local communication membrane subsystems utilize transport rules between membranes for coevolution of nondominated objects. The global ensemble membrane subsystems conduct a new dense representation multisize ensemble strategy to further improve the accuracy of the final results. Evaluations on two artificial data sets and 17 real-life data sets demonstrate the robustness of the proposed method in correctly clustering data sets with different dimensions and shapes. Our experimental results outperform those of both baseline and state-of-the-art methods. Moreover, benefiting from the parallelism, HCHPS is less time consuming than other methods, featuring an average completion time of 28.07 seconds on the 17 real-life data sets. Moreover, an ablation study shows that our proposed components are critical for effective cluster analysis.
topic Chain-hypergraph P system
multiobjective optimization
cluster analysis
url https://ieeexplore.ieee.org/document/8853276/
work_keys_str_mv AT shuoyan hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
AT yuanwang hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
AT detingkong hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
AT jinyanhu hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
AT jianhuaqu hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
AT xiyuliu hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
AT jiexue hybridchainhypergraphpsystemsformultiobjectiveensembleclustering
_version_ 1721539727468789760