Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension

Attribute selection is the process of selecting relevant attributes being used in model construction to enhance model accuracy. For general medical oriented classification applications, classical attribute selection methods principally select common attributes in the dataset for all individuals. The...

Full description

Bibliographic Details
Main Authors: Supansa Chaising, Punnarumol Temdee, Ramjee Prasad
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9443197/
id doaj-747bbecbbb114199abaf3a73ed56d95a
record_format Article
spelling doaj-747bbecbbb114199abaf3a73ed56d95a2021-06-14T23:01:05ZengIEEEIEEE Access2169-35362021-01-019827138272510.1109/ACCESS.2021.30846239443197Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With HypertensionSupansa Chaising0https://orcid.org/0000-0003-1418-2215Punnarumol Temdee1https://orcid.org/0000-0001-9847-157XRamjee Prasad2Computer and Communication Engineering for Capacity Building Research Center, School of Information Technology, Mae Fah Luang University, Chiang Rai, ThailandComputer and Communication Engineering for Capacity Building Research Center, School of Information Technology, Mae Fah Luang University, Chiang Rai, ThailandDepartment of Business Development and Technology, CTIF Global Capsule, Aarhus University, Herning, DenmarkAttribute selection is the process of selecting relevant attributes being used in model construction to enhance model accuracy. For general medical oriented classification applications, classical attribute selection methods principally select common attributes in the dataset for all individuals. The idea of using individual attributes is proposed in this study to represent the difference among individuals for self-diagnosis. Consequently, this study proposes a new attribute selection method, called information gain based distance (IGD), for individual attribute selection, which represents an individual’s health condition differently and can be used for effective classification. The proposed method combines the concept of information gain and objective distance to select individual attributes affecting classification. The IGD method is expected to provide higher classification performance than classical attribute selection methods. To assess the performance of the IGD method, classification accuracy between data with classical attribute selections and with the IGD method is compared. The case study is conducted with 971 secondary data used for group classification of elderly people with hypertension. The classification result of different classifiers was compared, including K-nearest neighbors, neural network, and naive Bayes. The comparison revealed that the classification of data with the IGD attribute selection method provided an average classification accuracy of 98.73%. In comparison, those classifications of data with classical attribute selection methods provided 62.99%, 62.99%, 62.62%, and 62.85% for information gain, Gini index, chi-squared, and decision tree, respectively. The results showed that data classification with the IGD method provided higher performance than those with the classical attribute selection methods.https://ieeexplore.ieee.org/document/9443197/Attribute selectionindividual attributesinformation gainhypertensionelderly people
collection DOAJ
language English
format Article
sources DOAJ
author Supansa Chaising
Punnarumol Temdee
Ramjee Prasad
spellingShingle Supansa Chaising
Punnarumol Temdee
Ramjee Prasad
Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension
IEEE Access
Attribute selection
individual attributes
information gain
hypertension
elderly people
author_facet Supansa Chaising
Punnarumol Temdee
Ramjee Prasad
author_sort Supansa Chaising
title Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension
title_short Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension
title_full Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension
title_fullStr Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension
title_full_unstemmed Individual Attribute Selection Using Information Gain Based Distance for Group Classification of Elderly People With Hypertension
title_sort individual attribute selection using information gain based distance for group classification of elderly people with hypertension
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Attribute selection is the process of selecting relevant attributes being used in model construction to enhance model accuracy. For general medical oriented classification applications, classical attribute selection methods principally select common attributes in the dataset for all individuals. The idea of using individual attributes is proposed in this study to represent the difference among individuals for self-diagnosis. Consequently, this study proposes a new attribute selection method, called information gain based distance (IGD), for individual attribute selection, which represents an individual’s health condition differently and can be used for effective classification. The proposed method combines the concept of information gain and objective distance to select individual attributes affecting classification. The IGD method is expected to provide higher classification performance than classical attribute selection methods. To assess the performance of the IGD method, classification accuracy between data with classical attribute selections and with the IGD method is compared. The case study is conducted with 971 secondary data used for group classification of elderly people with hypertension. The classification result of different classifiers was compared, including K-nearest neighbors, neural network, and naive Bayes. The comparison revealed that the classification of data with the IGD attribute selection method provided an average classification accuracy of 98.73%. In comparison, those classifications of data with classical attribute selection methods provided 62.99%, 62.99%, 62.62%, and 62.85% for information gain, Gini index, chi-squared, and decision tree, respectively. The results showed that data classification with the IGD method provided higher performance than those with the classical attribute selection methods.
topic Attribute selection
individual attributes
information gain
hypertension
elderly people
url https://ieeexplore.ieee.org/document/9443197/
work_keys_str_mv AT supansachaising individualattributeselectionusinginformationgainbaseddistanceforgroupclassificationofelderlypeoplewithhypertension
AT punnarumoltemdee individualattributeselectionusinginformationgainbaseddistanceforgroupclassificationofelderlypeoplewithhypertension
AT ramjeeprasad individualattributeselectionusinginformationgainbaseddistanceforgroupclassificationofelderlypeoplewithhypertension
_version_ 1721377816344264704