Granular Description of Data: A Comparative Study Regarding to Different Distance Measures

In data science, how to depict data in the concise and comprehensive way is an important issue. To address the issue, the key is to construct descriptors that are highly interpretable and can be used to reveal the data structure. Information granules, as one important role in the field of granular c...

Full description

Bibliographic Details
Main Authors: Fanzhong Meng, Chen Fu, Zhentang Shi, Wei Lu
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9141267/
id doaj-05fbd776209f45ffbde8a6a7e92680cf
record_format Article
spelling doaj-05fbd776209f45ffbde8a6a7e92680cf2021-03-30T04:49:02ZengIEEEIEEE Access2169-35362020-01-01813047613048510.1109/ACCESS.2020.30095419141267Granular Description of Data: A Comparative Study Regarding to Different Distance MeasuresFanzhong Meng0Chen Fu1https://orcid.org/0000-0003-3908-3184Zhentang Shi2Wei Lu3https://orcid.org/0000-0002-5775-1222Oilfield Exploration & Production Department, SINOPEC Corp., Beijing, ChinaSchool of Control Science and Engineering, Dalian University of Technology, Dalian, ChinaSINOPEC Dalian Research Institute of Petroleum and Petrochemicals, Dalian, ChinaSchool of Control Science and Engineering, Dalian University of Technology, Dalian, ChinaIn data science, how to depict data in the concise and comprehensive way is an important issue. To address the issue, the key is to construct descriptors that are highly interpretable and can be used to reveal the data structure. Information granules, as one important role in the field of granular computing, are entities that can be easily represented and abstracted from data. Therefore, by constructing a series of information granules, the characteristics of data can be captured and described, and the granular description of data is realized. A key part of the granular description of data is to explore the geometric characteristics (locations and shapes) of information granules used to describe data. Since distance measures directly affect the geometric characteristics of the constructed information granules, a comparative study based on three different distance measures is conducted in this paper. From the experimental results based on both synthetic and UCI repository datasets, it can be seen that the information granules constructed in the case where three different distance measures are used show different geometrical shapes, and can describe the data in a concise way. Furthermore, the data structure can be explored more comprehensively by using three distance measures.https://ieeexplore.ieee.org/document/9141267/Data descriptiongranular computinghypersphere information granulesdistance measures
collection DOAJ
language English
format Article
sources DOAJ
author Fanzhong Meng
Chen Fu
Zhentang Shi
Wei Lu
spellingShingle Fanzhong Meng
Chen Fu
Zhentang Shi
Wei Lu
Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
IEEE Access
Data description
granular computing
hypersphere information granules
distance measures
author_facet Fanzhong Meng
Chen Fu
Zhentang Shi
Wei Lu
author_sort Fanzhong Meng
title Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
title_short Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
title_full Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
title_fullStr Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
title_full_unstemmed Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
title_sort granular description of data: a comparative study regarding to different distance measures
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In data science, how to depict data in the concise and comprehensive way is an important issue. To address the issue, the key is to construct descriptors that are highly interpretable and can be used to reveal the data structure. Information granules, as one important role in the field of granular computing, are entities that can be easily represented and abstracted from data. Therefore, by constructing a series of information granules, the characteristics of data can be captured and described, and the granular description of data is realized. A key part of the granular description of data is to explore the geometric characteristics (locations and shapes) of information granules used to describe data. Since distance measures directly affect the geometric characteristics of the constructed information granules, a comparative study based on three different distance measures is conducted in this paper. From the experimental results based on both synthetic and UCI repository datasets, it can be seen that the information granules constructed in the case where three different distance measures are used show different geometrical shapes, and can describe the data in a concise way. Furthermore, the data structure can be explored more comprehensively by using three distance measures.
topic Data description
granular computing
hypersphere information granules
distance measures
url https://ieeexplore.ieee.org/document/9141267/
work_keys_str_mv AT fanzhongmeng granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures
AT chenfu granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures
AT zhentangshi granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures
AT weilu granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures
_version_ 1724181178015023104