A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters

For the shortcoming of fuzzy c-means algorithm (FCM) needing to know the number of clusters in advance, this paper proposed a new self-adaptive method to determine the optimal number of clusters. Firstly, a density-based algorithm was put forward. The algorithm, according to the characteristics of t...

Full description

Bibliographic Details
Main Authors: Min Ren, Peiyu Liu, Zhihao Wang, Jing Yi
Format: Article
Language:English
Published: Hindawi Limited 2016-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2016/2647389
id doaj-928126b1f5534962a266bdb1da6f51d5
record_format Article
spelling doaj-928126b1f5534962a266bdb1da6f51d52020-11-24T21:04:42ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52651687-52732016-01-01201610.1155/2016/26473892647389A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of ClustersMin Ren0Peiyu Liu1Zhihao Wang2Jing Yi3School of Information Science and Engineering, Shandong Normal University, Jinan, Shandong, ChinaSchool of Information Science and Engineering, Shandong Normal University, Jinan, Shandong, ChinaSchool of Information Science and Engineering, Shandong Normal University, Jinan, Shandong, ChinaSchool of Information Science and Engineering, Shandong Normal University, Jinan, Shandong, ChinaFor the shortcoming of fuzzy c-means algorithm (FCM) needing to know the number of clusters in advance, this paper proposed a new self-adaptive method to determine the optimal number of clusters. Firstly, a density-based algorithm was put forward. The algorithm, according to the characteristics of the dataset, automatically determined the possible maximum number of clusters instead of using the empirical rule n and obtained the optimal initial cluster centroids, improving the limitation of FCM that randomly selected cluster centroids lead the convergence result to the local minimum. Secondly, this paper, by introducing a penalty function, proposed a new fuzzy clustering validity index based on fuzzy compactness and separation, which ensured that when the number of clusters verged on that of objects in the dataset, the value of clustering validity index did not monotonically decrease and was close to zero, so that the optimal number of clusters lost robustness and decision function. Then, based on these studies, a self-adaptive FCM algorithm was put forward to estimate the optimal number of clusters by the iterative trial-and-error process. At last, experiments were done on the UCI, KDD Cup 1999, and synthetic datasets, which showed that the method not only effectively determined the optimal number of clusters, but also reduced the iteration of FCM with the stable clustering result.http://dx.doi.org/10.1155/2016/2647389
collection DOAJ
language English
format Article
sources DOAJ
author Min Ren
Peiyu Liu
Zhihao Wang
Jing Yi
spellingShingle Min Ren
Peiyu Liu
Zhihao Wang
Jing Yi
A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
Computational Intelligence and Neuroscience
author_facet Min Ren
Peiyu Liu
Zhihao Wang
Jing Yi
author_sort Min Ren
title A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
title_short A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
title_full A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
title_fullStr A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
title_full_unstemmed A Self-Adaptive Fuzzy c-Means Algorithm for Determining the Optimal Number of Clusters
title_sort self-adaptive fuzzy c-means algorithm for determining the optimal number of clusters
publisher Hindawi Limited
series Computational Intelligence and Neuroscience
issn 1687-5265
1687-5273
publishDate 2016-01-01
description For the shortcoming of fuzzy c-means algorithm (FCM) needing to know the number of clusters in advance, this paper proposed a new self-adaptive method to determine the optimal number of clusters. Firstly, a density-based algorithm was put forward. The algorithm, according to the characteristics of the dataset, automatically determined the possible maximum number of clusters instead of using the empirical rule n and obtained the optimal initial cluster centroids, improving the limitation of FCM that randomly selected cluster centroids lead the convergence result to the local minimum. Secondly, this paper, by introducing a penalty function, proposed a new fuzzy clustering validity index based on fuzzy compactness and separation, which ensured that when the number of clusters verged on that of objects in the dataset, the value of clustering validity index did not monotonically decrease and was close to zero, so that the optimal number of clusters lost robustness and decision function. Then, based on these studies, a self-adaptive FCM algorithm was put forward to estimate the optimal number of clusters by the iterative trial-and-error process. At last, experiments were done on the UCI, KDD Cup 1999, and synthetic datasets, which showed that the method not only effectively determined the optimal number of clusters, but also reduced the iteration of FCM with the stable clustering result.
url http://dx.doi.org/10.1155/2016/2647389
work_keys_str_mv AT minren aselfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT peiyuliu aselfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT zhihaowang aselfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT jingyi aselfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT minren selfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT peiyuliu selfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT zhihaowang selfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
AT jingyi selfadaptivefuzzycmeansalgorithmfordeterminingtheoptimalnumberofclusters
_version_ 1716770180612227072