CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM

Cluster analysis is the unsupervised learning technique that finds the interesting patterns in the data objects without knowing class labels. Most of the real world dataset consists of categorical data. For example, social media analysis may have the categorical data like the gender as male or femal...

Full description

Bibliographic Details
Main Authors: Lakshmi K, Karthikeyani Visalakshi, S Shanthi, S Parvathavarthini
Format: Article
Language:English
Published: ICT Academy of Tamil Nadu 2017-10-01
Series:ICTACT Journal on Communication Technology
Subjects:
Online Access:http://ictactjournals.in/ArticleDetails.aspx?id=3187
id doaj-a110ebe3789e40e79cca85656a812a0c
record_format Article
spelling doaj-a110ebe3789e40e79cca85656a812a0c2020-11-24T23:51:17ZengICT Academy of Tamil NaduICTACT Journal on Communication Technology0976-65612229-69482017-10-01811561156610.21917/ijsc.2017.0218CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHMLakshmi K0 Karthikeyani Visalakshi1S Shanthi2S Parvathavarthini3Kongu Engineering College, IndiaNKR Government Arts College for Women, IndiaKongu Engineering College, IndiaKongu Engineering College, IndiaCluster analysis is the unsupervised learning technique that finds the interesting patterns in the data objects without knowing class labels. Most of the real world dataset consists of categorical data. For example, social media analysis may have the categorical data like the gender as male or female. The k-modes clustering algorithm is the most widely used to group the categorical data, because it is easy to implement and efficient to handle the large amount of data. However, due to its random selection of initial centroids, it provides the local optimum solution. There are number of optimization algorithms are developed to obtain global optimum solution. Cuckoo Search algorithm is the population based metaheuristic optimization algorithms to provide the global optimum solution. Methods: In this paper, k-modes clustering algorithm is combined with Cuckoo Search algorithm to obtain the global optimum solution. Results: Experiments are conducted with benchmark datasets and the results are compared with k-modes and Particle Swarm Optimization with k-modes to prove the efficiency of the proposed algorithm. http://ictactjournals.in/ArticleDetails.aspx?id=3187Cluster Analysisk-ModesCuckoo Search OptimizationLocal OptimaInitial Centroids
collection DOAJ
language English
format Article
sources DOAJ
author Lakshmi K
Karthikeyani Visalakshi
S Shanthi
S Parvathavarthini
spellingShingle Lakshmi K
Karthikeyani Visalakshi
S Shanthi
S Parvathavarthini
CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM
ICTACT Journal on Communication Technology
Cluster Analysis
k-Modes
Cuckoo Search Optimization
Local Optima
Initial Centroids
author_facet Lakshmi K
Karthikeyani Visalakshi
S Shanthi
S Parvathavarthini
author_sort Lakshmi K
title CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM
title_short CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM
title_full CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM
title_fullStr CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM
title_full_unstemmed CLUSTERING CATEGORICAL DATA USING k-MODES BASED ON CUCKOO SEARCH OPTIMIZATION ALGORITHM
title_sort clustering categorical data using k-modes based on cuckoo search optimization algorithm
publisher ICT Academy of Tamil Nadu
series ICTACT Journal on Communication Technology
issn 0976-6561
2229-6948
publishDate 2017-10-01
description Cluster analysis is the unsupervised learning technique that finds the interesting patterns in the data objects without knowing class labels. Most of the real world dataset consists of categorical data. For example, social media analysis may have the categorical data like the gender as male or female. The k-modes clustering algorithm is the most widely used to group the categorical data, because it is easy to implement and efficient to handle the large amount of data. However, due to its random selection of initial centroids, it provides the local optimum solution. There are number of optimization algorithms are developed to obtain global optimum solution. Cuckoo Search algorithm is the population based metaheuristic optimization algorithms to provide the global optimum solution. Methods: In this paper, k-modes clustering algorithm is combined with Cuckoo Search algorithm to obtain the global optimum solution. Results: Experiments are conducted with benchmark datasets and the results are compared with k-modes and Particle Swarm Optimization with k-modes to prove the efficiency of the proposed algorithm.
topic Cluster Analysis
k-Modes
Cuckoo Search Optimization
Local Optima
Initial Centroids
url http://ictactjournals.in/ArticleDetails.aspx?id=3187
work_keys_str_mv AT lakshmik clusteringcategoricaldatausingkmodesbasedoncuckoosearchoptimizationalgorithm
AT karthikeyanivisalakshi clusteringcategoricaldatausingkmodesbasedoncuckoosearchoptimizationalgorithm
AT sshanthi clusteringcategoricaldatausingkmodesbasedoncuckoosearchoptimizationalgorithm
AT sparvathavarthini clusteringcategoricaldatausingkmodesbasedoncuckoosearchoptimizationalgorithm
_version_ 1725476616114536448