Prediction of hematocrit through imbalanced dataset of blood spectra
Abstract In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imb...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2021-04-01
|
Series: | Healthcare Technology Letters |
Online Access: | https://doi.org/10.1049/htl2.12006 |
id |
doaj-d0277ba1e6e748429d9f740609be6e76 |
---|---|
record_format |
Article |
spelling |
doaj-d0277ba1e6e748429d9f740609be6e762021-04-20T13:45:04ZengWileyHealthcare Technology Letters2053-37132021-04-0182374410.1049/htl2.12006Prediction of hematocrit through imbalanced dataset of blood spectraCristoforo Decaro0Giovanni Battista Montanari1Marco Bianconi2Gaetano Bellanca3Department of Engineering University of Ferrara Ferrara ItalyMISTER Smart Innovation Bologna 40129 ItalyCNR‐IMM‐UOS di Bologna and MISTER Smart Innovation Bologna ItalyDepartment of Engineering University of Ferrara Ferrara ItalyAbstract In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing.https://doi.org/10.1049/htl2.12006 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Cristoforo Decaro Giovanni Battista Montanari Marco Bianconi Gaetano Bellanca |
spellingShingle |
Cristoforo Decaro Giovanni Battista Montanari Marco Bianconi Gaetano Bellanca Prediction of hematocrit through imbalanced dataset of blood spectra Healthcare Technology Letters |
author_facet |
Cristoforo Decaro Giovanni Battista Montanari Marco Bianconi Gaetano Bellanca |
author_sort |
Cristoforo Decaro |
title |
Prediction of hematocrit through imbalanced dataset of blood spectra |
title_short |
Prediction of hematocrit through imbalanced dataset of blood spectra |
title_full |
Prediction of hematocrit through imbalanced dataset of blood spectra |
title_fullStr |
Prediction of hematocrit through imbalanced dataset of blood spectra |
title_full_unstemmed |
Prediction of hematocrit through imbalanced dataset of blood spectra |
title_sort |
prediction of hematocrit through imbalanced dataset of blood spectra |
publisher |
Wiley |
series |
Healthcare Technology Letters |
issn |
2053-3713 |
publishDate |
2021-04-01 |
description |
Abstract In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing. |
url |
https://doi.org/10.1049/htl2.12006 |
work_keys_str_mv |
AT cristoforodecaro predictionofhematocritthroughimbalanceddatasetofbloodspectra AT giovannibattistamontanari predictionofhematocritthroughimbalanceddatasetofbloodspectra AT marcobianconi predictionofhematocritthroughimbalanceddatasetofbloodspectra AT gaetanobellanca predictionofhematocritthroughimbalanceddatasetofbloodspectra |
_version_ |
1721517771473289216 |