Granular Description of Data: A Comparative Study Regarding to Different Distance Measures
In data science, how to depict data in the concise and comprehensive way is an important issue. To address the issue, the key is to construct descriptors that are highly interpretable and can be used to reveal the data structure. Information granules, as one important role in the field of granular c...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9141267/ |
id |
doaj-05fbd776209f45ffbde8a6a7e92680cf |
---|---|
record_format |
Article |
spelling |
doaj-05fbd776209f45ffbde8a6a7e92680cf2021-03-30T04:49:02ZengIEEEIEEE Access2169-35362020-01-01813047613048510.1109/ACCESS.2020.30095419141267Granular Description of Data: A Comparative Study Regarding to Different Distance MeasuresFanzhong Meng0Chen Fu1https://orcid.org/0000-0003-3908-3184Zhentang Shi2Wei Lu3https://orcid.org/0000-0002-5775-1222Oilfield Exploration & Production Department, SINOPEC Corp., Beijing, ChinaSchool of Control Science and Engineering, Dalian University of Technology, Dalian, ChinaSINOPEC Dalian Research Institute of Petroleum and Petrochemicals, Dalian, ChinaSchool of Control Science and Engineering, Dalian University of Technology, Dalian, ChinaIn data science, how to depict data in the concise and comprehensive way is an important issue. To address the issue, the key is to construct descriptors that are highly interpretable and can be used to reveal the data structure. Information granules, as one important role in the field of granular computing, are entities that can be easily represented and abstracted from data. Therefore, by constructing a series of information granules, the characteristics of data can be captured and described, and the granular description of data is realized. A key part of the granular description of data is to explore the geometric characteristics (locations and shapes) of information granules used to describe data. Since distance measures directly affect the geometric characteristics of the constructed information granules, a comparative study based on three different distance measures is conducted in this paper. From the experimental results based on both synthetic and UCI repository datasets, it can be seen that the information granules constructed in the case where three different distance measures are used show different geometrical shapes, and can describe the data in a concise way. Furthermore, the data structure can be explored more comprehensively by using three distance measures.https://ieeexplore.ieee.org/document/9141267/Data descriptiongranular computinghypersphere information granulesdistance measures |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Fanzhong Meng Chen Fu Zhentang Shi Wei Lu |
spellingShingle |
Fanzhong Meng Chen Fu Zhentang Shi Wei Lu Granular Description of Data: A Comparative Study Regarding to Different Distance Measures IEEE Access Data description granular computing hypersphere information granules distance measures |
author_facet |
Fanzhong Meng Chen Fu Zhentang Shi Wei Lu |
author_sort |
Fanzhong Meng |
title |
Granular Description of Data: A Comparative Study Regarding to Different Distance Measures |
title_short |
Granular Description of Data: A Comparative Study Regarding to Different Distance Measures |
title_full |
Granular Description of Data: A Comparative Study Regarding to Different Distance Measures |
title_fullStr |
Granular Description of Data: A Comparative Study Regarding to Different Distance Measures |
title_full_unstemmed |
Granular Description of Data: A Comparative Study Regarding to Different Distance Measures |
title_sort |
granular description of data: a comparative study regarding to different distance measures |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
In data science, how to depict data in the concise and comprehensive way is an important issue. To address the issue, the key is to construct descriptors that are highly interpretable and can be used to reveal the data structure. Information granules, as one important role in the field of granular computing, are entities that can be easily represented and abstracted from data. Therefore, by constructing a series of information granules, the characteristics of data can be captured and described, and the granular description of data is realized. A key part of the granular description of data is to explore the geometric characteristics (locations and shapes) of information granules used to describe data. Since distance measures directly affect the geometric characteristics of the constructed information granules, a comparative study based on three different distance measures is conducted in this paper. From the experimental results based on both synthetic and UCI repository datasets, it can be seen that the information granules constructed in the case where three different distance measures are used show different geometrical shapes, and can describe the data in a concise way. Furthermore, the data structure can be explored more comprehensively by using three distance measures. |
topic |
Data description granular computing hypersphere information granules distance measures |
url |
https://ieeexplore.ieee.org/document/9141267/ |
work_keys_str_mv |
AT fanzhongmeng granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures AT chenfu granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures AT zhentangshi granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures AT weilu granulardescriptionofdataacomparativestudyregardingtodifferentdistancemeasures |
_version_ |
1724181178015023104 |