A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator
Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 25...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Diponegoro University
2020-10-01
|
Series: | Jurnal Teknologi dan Sistem Komputer |
Subjects: | |
Online Access: | https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13625 |
id |
doaj-26f630a7a2634550bcd82de0b5d91fc9 |
---|---|
record_format |
Article |
spelling |
doaj-26f630a7a2634550bcd82de0b5d91fc92021-10-02T12:28:17ZengDiponegoro UniversityJurnal Teknologi dan Sistem Komputer2338-04032020-10-018427628310.14710/jtsiskom.2020.1362512837A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicatorAhmad Taufiq Akbar0Rochmat Husaini1Bagus Muhammad Akbar2Shoffan Saifullah3Department of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaDepartment of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaDepartment of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaDepartment of Informatics, Universitas Pembangunan Nasional Veteran Yogyakarta, IndonesiaBlood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly.https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13625imbalance datablood typeresamplek-nearest neighbormbti |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ahmad Taufiq Akbar Rochmat Husaini Bagus Muhammad Akbar Shoffan Saifullah |
spellingShingle |
Ahmad Taufiq Akbar Rochmat Husaini Bagus Muhammad Akbar Shoffan Saifullah A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator Jurnal Teknologi dan Sistem Komputer imbalance data blood type resample k-nearest neighbor mbti |
author_facet |
Ahmad Taufiq Akbar Rochmat Husaini Bagus Muhammad Akbar Shoffan Saifullah |
author_sort |
Ahmad Taufiq Akbar |
title |
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator |
title_short |
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator |
title_full |
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator |
title_fullStr |
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator |
title_full_unstemmed |
A proposed method for handling an imbalance data in classification of blood type based on Myers-Briggs type indicator |
title_sort |
proposed method for handling an imbalance data in classification of blood type based on myers-briggs type indicator |
publisher |
Diponegoro University |
series |
Jurnal Teknologi dan Sistem Komputer |
issn |
2338-0403 |
publishDate |
2020-10-01 |
description |
Blood type still leads to an assumption about its relation to some personality aspects. This study observes preprocessing methods for improving the classification accuracy of MBTI data to determine blood type. The training and testing data use 250 data from the MBTI questionnaire answers given by 250 respondents. The classification uses the k-Nearest Neighbor (k-NN) algorithm. Without preprocessing, k-NN results in about 32 % accuracy, so it needs some preprocessing to handle data imbalance before the classification. The proposed preprocessing consists of two-stage, the first stage is the unsupervised resample, and the second is the supervised resample. For the validation, it uses ten cross-validations. The result of k-Nearest Neighbor classification after using these proposed preprocessing stages has finally increased the accuracy, F-score, and recall significantly. |
topic |
imbalance data blood type resample k-nearest neighbor mbti |
url |
https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13625 |
work_keys_str_mv |
AT ahmadtaufiqakbar aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT rochmathusaini aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT bagusmuhammadakbar aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT shoffansaifullah aproposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT ahmadtaufiqakbar proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT rochmathusaini proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT bagusmuhammadakbar proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator AT shoffansaifullah proposedmethodforhandlinganimbalancedatainclassificationofbloodtypebasedonmyersbriggstypeindicator |
_version_ |
1716855686546063360 |