A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator

Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 25...

Full description

Bibliographic Details
Main Authors: Ahmad Taufiq Akbar, Rochmat Husaini, Bagus Muhammad Akbar, Shoffan Saifullah
Format: Article
Language:English
Published: Diponegoro University 2020-10-01
Series:Jurnal Teknologi dan Sistem Komputer
Subjects:
Online Access:https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13625
id doaj-26f630a7a2634550bcd82de0b5d91fc9
record_format Article
spelling doaj-26f630a7a2634550bcd82de0b5d91fc92021-10-02T12:28:17ZengDiponegoro UniversityJurnal Teknologi dan Sistem Komputer2338-04032020-10-018427628310.14710/jtsiskom.2020.1362512837A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicatorAhmad Taufiq Akbar0Rochmat Husaini1Bagus Muhammad Akbar2Shoffan Saifullah3Department of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaDepartment of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaDepartment of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaDepartment of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaBlood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13625imbalance datablood typeresamplek-nearest neighbormbti
collection DOAJ
language English
format Article
sources DOAJ
author Ahmad Taufiq Akbar
Rochmat Husaini
Bagus Muhammad Akbar
Shoffan Saifullah
spellingShingle Ahmad Taufiq Akbar
Rochmat Husaini
Bagus Muhammad Akbar
Shoffan Saifullah
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
Jurnal Teknologi dan Sistem Komputer
imbalance data
blood type
resample
k-nearest neighbor
mbti
author_facet Ahmad Taufiq Akbar
Rochmat Husaini
Bagus Muhammad Akbar
Shoffan Saifullah
author_sort Ahmad Taufiq Akbar
title A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
title_short A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
title_full A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
title_fullStr A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
title_full_unstemmed A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
title_sort proposed method for handling an imbalance data in classification of blood type based on myers-briggs type indicator
publisher Diponegoro University
series Jurnal Teknologi dan Sistem Komputer
issn 2338-0403
publishDate 2020-10-01
description Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.
topic imbalance data
blood type
resample
k-nearest neighbor
mbti
url https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13625
work_keys_str_mv AT ahmadtaufiqakbar aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT rochmathusaini aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT bagusmuhammadakbar aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT shoffansaifullah aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT ahmadtaufiqakbar proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT rochmathusaini proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT bagusmuhammadakbar proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
AT shoffansaifullah proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator
_version_ 1716855686546063360