Linguistically Defined Clustering of Data

This paper introduces a method of data clustering that is based on linguistically specified rules, similar to those applied by a human visually fulfilling a task. The method endeavors to follow these remarkable capabilities of intelligent beings. Even for most complicated data patterns a human is ca...

Full description

Bibliographic Details
Main Authors:	Leski Jacek M., Kotas Marian P.
Format:	Article
Language:	English
Published:	Sciendo 2018-09-01
Series:	International Journal of Applied Mathematics and Computer Science
Subjects:	clustering possibility theory linguistic rules data analysis
Online Access:	https://doi.org/10.2478/amcs-2018-0042

id	doaj-194849618f7d42a48bd8a26942d5cc9c
record_format	Article
spelling	doaj-194849618f7d42a48bd8a26942d5cc9c2021-09-06T19:41:09ZengSciendoInternational Journal of Applied Mathematics and Computer Science2083-84922018-09-0128354555710.2478/amcs-2018-0042amcs-2018-0042Linguistically Defined Clustering of DataLeski Jacek M.0Kotas Marian P.1Institute of Medical Technology & Equipment ITAM, Roosevelta 118,Zabrze, PolandInstitute of Electronics Silesian University of Technology, Akademicka 16,Gliwice, PolandThis paper introduces a method of data clustering that is based on linguistically specified rules, similar to those applied by a human visually fulfilling a task. The method endeavors to follow these remarkable capabilities of intelligent beings. Even for most complicated data patterns a human is capable of accomplishing the clustering process using relatively simple rules. His/her way of clustering is a sequential search for new structures in the data and new prototypes with the use of the following linguistic rule: search for prototypes in regions of extremely high data densities and immensely far from the previously found ones. Then, after this search has been completed, the respective data have to be assigned to any of the clusters whose nuclei (prototypes) have been found. A human again uses a simple linguistic rule: data from regions with similar densities, which are located exceedingly close to each other, should belong to the same cluster. The goal of this work is to prove experimentally that such simple linguistic rules can result in a clustering method that is competitive with the most effective methods known from the literature on the subject. A linguistic formulation of a validity index for determination of the number of clusters is also presented. Finally, an extensive experimental analysis of benchmark datasets is performed to demonstrate the validity of the clustering approach introduced. Its competitiveness with the state-of-the-art solutions is also shown.https://doi.org/10.2478/amcs-2018-0042clusteringpossibility theorylinguistic rulesdata analysis
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Leski Jacek M. Kotas Marian P.
spellingShingle	Leski Jacek M. Kotas Marian P. Linguistically Defined Clustering of Data International Journal of Applied Mathematics and Computer Science clustering possibility theory linguistic rules data analysis
author_facet	Leski Jacek M. Kotas Marian P.
author_sort	Leski Jacek M.
title	Linguistically Defined Clustering of Data
title_short	Linguistically Defined Clustering of Data
title_full	Linguistically Defined Clustering of Data
title_fullStr	Linguistically Defined Clustering of Data
title_full_unstemmed	Linguistically Defined Clustering of Data
title_sort	linguistically defined clustering of data
publisher	Sciendo
series	International Journal of Applied Mathematics and Computer Science
issn	2083-8492
publishDate	2018-09-01
description	This paper introduces a method of data clustering that is based on linguistically specified rules, similar to those applied by a human visually fulfilling a task. The method endeavors to follow these remarkable capabilities of intelligent beings. Even for most complicated data patterns a human is capable of accomplishing the clustering process using relatively simple rules. His/her way of clustering is a sequential search for new structures in the data and new prototypes with the use of the following linguistic rule: search for prototypes in regions of extremely high data densities and immensely far from the previously found ones. Then, after this search has been completed, the respective data have to be assigned to any of the clusters whose nuclei (prototypes) have been found. A human again uses a simple linguistic rule: data from regions with similar densities, which are located exceedingly close to each other, should belong to the same cluster. The goal of this work is to prove experimentally that such simple linguistic rules can result in a clustering method that is competitive with the most effective methods known from the literature on the subject. A linguistic formulation of a validity index for determination of the number of clusters is also presented. Finally, an extensive experimental analysis of benchmark datasets is performed to demonstrate the validity of the clustering approach introduced. Its competitiveness with the state-of-the-art solutions is also shown.
topic	clustering possibility theory linguistic rules data analysis
url	https://doi.org/10.2478/amcs-2018-0042
work_keys_str_mv	AT leskijacekm linguisticallydefinedclusteringofdata AT kotasmarianp linguisticallydefinedclusteringofdata
_version_	1717766962085363712

Linguistically Defined Clustering of Data

Similar Items