Multiple Cooperative Swarms for Data Clustering
Exploring a set of unlabeled data to extract the similar clusters, known as data clustering, is an appealing problem in machine learning. In other words, data clustering organizes the underlying data into different groups using a notion of similarity between patterns. A new approach to solve the da...
Main Author: | |
---|---|
Language: | en |
Published: |
2008
|
Subjects: | |
Online Access: | http://hdl.handle.net/10012/4031 |
id |
ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-4031 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-LACETR-oai-collectionscanada.gc.ca-OWTU.10012-40312013-10-04T04:08:40ZAhmadi, Abbas2008-09-24T20:23:04Z2008-09-24T20:23:04Z2008-09-24T20:23:04Z2008http://hdl.handle.net/10012/4031Exploring a set of unlabeled data to extract the similar clusters, known as data clustering, is an appealing problem in machine learning. In other words, data clustering organizes the underlying data into different groups using a notion of similarity between patterns. A new approach to solve the data clustering problem based on multiple cooperative swarms is introduced. The proposed approach is inspired by the social swarming behavior of biological bird flocks which search for food situated in several places. The proposed approach is composed of two main phases, namely, initialization and exploitation. In the initialization phase, the aim is to distribute the search space among several swarms. That is, a part of the search space is assigned to each swarm in this phase. In the exploitation phase, each swarm searches for the center of its associated cluster while cooperating with other swarms. The search proceeds to converge to a near-optimal solution. As compared to the single swarm clustering approach, the proposed multiple cooperative swarms provide better solutions in terms of fitness function measure for the cluster centers, as the dimensionality of data and number of clusters increase. The multiple cooperative swarms clustering approach assumes that the number of clusters is known a priori. The notion of stability analysis is proposed to extract the number of clusters for the underlying data using multiple cooperative swarms. The mathematical explanations demonstrating why the proposed approach leads to more stable and robust results than those of the single swarm clustering are also provided. Application of the proposed multiple cooperative swarms clustering is considered for one of the most challenging problems in speech recognition: phoneme recognition. The proposed approach is used to decompose the recognition task into a number of subtasks or modules. Each module involves a set of similar phonemes known as a phoneme family. Basically, the goal is to obtain the best solution for phoneme families using the proposed multiple cooperative swarms clustering. The experiments using the standard TIMIT corpus indicate that using the proposed clustering approach boosts the accuracy of the modular approach for phoneme recognition considerably.enData clusteringSwarm intelligenceParticle swarm optimizationCooperative swarmsMultiple Cooperative Swarms for Data ClusteringThesis or DissertationSystems Design EngineeringDoctor of PhilosophySystem Design Engineering |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
topic |
Data clustering Swarm intelligence Particle swarm optimization Cooperative swarms System Design Engineering |
spellingShingle |
Data clustering Swarm intelligence Particle swarm optimization Cooperative swarms System Design Engineering Ahmadi, Abbas Multiple Cooperative Swarms for Data Clustering |
description |
Exploring a set of unlabeled data to extract the similar clusters,
known as data clustering, is an appealing problem in machine
learning. In other words, data clustering organizes the underlying
data into different groups using a notion of similarity between
patterns.
A new approach to solve the data clustering problem based on
multiple cooperative swarms is introduced. The proposed approach is
inspired by the social swarming behavior of biological bird flocks
which search for food situated in several places. The proposed
approach is composed of two main phases, namely, initialization and
exploitation. In the initialization phase, the aim is to distribute
the search space among several swarms. That is, a part of the search
space is assigned to each swarm in this phase. In the exploitation
phase, each swarm searches for the center of its associated cluster
while cooperating with other swarms. The search proceeds to converge
to a near-optimal solution. As compared to the single swarm
clustering approach, the proposed multiple cooperative swarms
provide better solutions in terms of fitness function measure for
the cluster centers, as the dimensionality of data and number of
clusters increase.
The multiple cooperative swarms clustering approach assumes that the
number of clusters is known a priori. The notion of stability
analysis is proposed to extract the number of clusters for the
underlying data using multiple cooperative swarms. The mathematical
explanations demonstrating why the proposed approach leads to more
stable and robust results than those of the single swarm clustering
are also provided.
Application of the proposed multiple cooperative swarms clustering
is considered for one of the most challenging problems in speech
recognition: phoneme recognition. The proposed approach is used to
decompose the recognition task into a number of subtasks or modules.
Each module involves a set of similar phonemes known as a phoneme
family. Basically, the goal is to obtain the best solution for
phoneme families using the proposed multiple cooperative swarms
clustering. The experiments using the standard TIMIT corpus indicate
that using the proposed clustering approach boosts the accuracy of
the modular approach for phoneme recognition considerably. |
author |
Ahmadi, Abbas |
author_facet |
Ahmadi, Abbas |
author_sort |
Ahmadi, Abbas |
title |
Multiple Cooperative Swarms for Data Clustering |
title_short |
Multiple Cooperative Swarms for Data Clustering |
title_full |
Multiple Cooperative Swarms for Data Clustering |
title_fullStr |
Multiple Cooperative Swarms for Data Clustering |
title_full_unstemmed |
Multiple Cooperative Swarms for Data Clustering |
title_sort |
multiple cooperative swarms for data clustering |
publishDate |
2008 |
url |
http://hdl.handle.net/10012/4031 |
work_keys_str_mv |
AT ahmadiabbas multiplecooperativeswarmsfordataclustering |
_version_ |
1716600075229069312 |