Multiple Cooperative Swarms for Data Clustering

Exploring a set of unlabeled data to extract the similar clusters, known as data clustering, is an appealing problem in machine learning. In other words, data clustering organizes the underlying data into different groups using a notion of similarity between patterns. A new approach to solve the da...

Full description

Bibliographic Details
Main Author: Ahmadi, Abbas
Language:en
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10012/4031
id ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-4031
record_format oai_dc
spelling ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-40312013-10-04T04:08:40ZAhmadi, Abbas2008-09-24T20:23:04Z2008-09-24T20:23:04Z2008-09-24T20:23:04Z2008http://hdl.handle.net/10012/4031Exploring a set of unlabeled data to extract the similar clusters, known as data clustering, is an appealing problem in machine learning. In other words, data clustering organizes the underlying data into different groups using a notion of similarity between patterns. A new approach to solve the data clustering problem based on multiple cooperative swarms is introduced. The proposed approach is inspired by the social swarming behavior of biological bird flocks which search for food situated in several places. The proposed approach is composed of two main phases, namely, initialization and exploitation. In the initialization phase, the aim is to distribute the search space among several swarms. That is, a part of the search space is assigned to each swarm in this phase. In the exploitation phase, each swarm searches for the center of its associated cluster while cooperating with other swarms. The search proceeds to converge to a near-optimal solution. As compared to the single swarm clustering approach, the proposed multiple cooperative swarms provide better solutions in terms of fitness function measure for the cluster centers, as the dimensionality of data and number of clusters increase. The multiple cooperative swarms clustering approach assumes that the number of clusters is known a priori. The notion of stability analysis is proposed to extract the number of clusters for the underlying data using multiple cooperative swarms. The mathematical explanations demonstrating why the proposed approach leads to more stable and robust results than those of the single swarm clustering are also provided. Application of the proposed multiple cooperative swarms clustering is considered for one of the most challenging problems in speech recognition: phoneme recognition. The proposed approach is used to decompose the recognition task into a number of subtasks or modules. Each module involves a set of similar phonemes known as a phoneme family. Basically, the goal is to obtain the best solution for phoneme families using the proposed multiple cooperative swarms clustering. The experiments using the standard TIMIT corpus indicate that using the proposed clustering approach boosts the accuracy of the modular approach for phoneme recognition considerably.enData clusteringSwarm intelligenceParticle swarm optimizationCooperative swarmsMultiple Cooperative Swarms for Data ClusteringThesis or DissertationSystems Design EngineeringDoctor of PhilosophySystem Design Engineering
collection NDLTD
language en
sources NDLTD
topic Data clustering
Swarm intelligence
Particle swarm optimization
Cooperative swarms
System Design Engineering
spellingShingle Data clustering
Swarm intelligence
Particle swarm optimization
Cooperative swarms
System Design Engineering
Ahmadi, Abbas
Multiple Cooperative Swarms for Data Clustering
description Exploring a set of unlabeled data to extract the similar clusters, known as data clustering, is an appealing problem in machine learning. In other words, data clustering organizes the underlying data into different groups using a notion of similarity between patterns. A new approach to solve the data clustering problem based on multiple cooperative swarms is introduced. The proposed approach is inspired by the social swarming behavior of biological bird flocks which search for food situated in several places. The proposed approach is composed of two main phases, namely, initialization and exploitation. In the initialization phase, the aim is to distribute the search space among several swarms. That is, a part of the search space is assigned to each swarm in this phase. In the exploitation phase, each swarm searches for the center of its associated cluster while cooperating with other swarms. The search proceeds to converge to a near-optimal solution. As compared to the single swarm clustering approach, the proposed multiple cooperative swarms provide better solutions in terms of fitness function measure for the cluster centers, as the dimensionality of data and number of clusters increase. The multiple cooperative swarms clustering approach assumes that the number of clusters is known a priori. The notion of stability analysis is proposed to extract the number of clusters for the underlying data using multiple cooperative swarms. The mathematical explanations demonstrating why the proposed approach leads to more stable and robust results than those of the single swarm clustering are also provided. Application of the proposed multiple cooperative swarms clustering is considered for one of the most challenging problems in speech recognition: phoneme recognition. The proposed approach is used to decompose the recognition task into a number of subtasks or modules. Each module involves a set of similar phonemes known as a phoneme family. Basically, the goal is to obtain the best solution for phoneme families using the proposed multiple cooperative swarms clustering. The experiments using the standard TIMIT corpus indicate that using the proposed clustering approach boosts the accuracy of the modular approach for phoneme recognition considerably.
author Ahmadi, Abbas
author_facet Ahmadi, Abbas
author_sort Ahmadi, Abbas
title Multiple Cooperative Swarms for Data Clustering
title_short Multiple Cooperative Swarms for Data Clustering
title_full Multiple Cooperative Swarms for Data Clustering
title_fullStr Multiple Cooperative Swarms for Data Clustering
title_full_unstemmed Multiple Cooperative Swarms for Data Clustering
title_sort multiple cooperative swarms for data clustering
publishDate 2008
url http://hdl.handle.net/10012/4031
work_keys_str_mv AT ahmadiabbas multiplecooperativeswarmsfordataclustering
_version_ 1716600075229069312