Prediction of hematocrit through imbalanced dataset of blood spectra

Abstract In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imb...

Full description

Bibliographic Details
Main Authors: Cristoforo Decaro, Giovanni Battista Montanari, Marco Bianconi, Gaetano Bellanca
Format: Article
Language:English
Published: Wiley 2021-04-01
Series:Healthcare Technology Letters
Online Access:https://doi.org/10.1049/htl2.12006
id doaj-d0277ba1e6e748429d9f740609be6e76
record_format Article
spelling doaj-d0277ba1e6e748429d9f740609be6e762021-04-20T13:45:04ZengWileyHealthcare Technology Letters2053-37132021-04-0182374410.1049/htl2.12006Prediction of hematocrit through imbalanced dataset of blood spectraCristoforo Decaro0Giovanni Battista Montanari1Marco Bianconi2Gaetano Bellanca3Department of Engineering University of Ferrara Ferrara ItalyMISTER Smart Innovation Bologna 40129 ItalyCNR‐IMM‐UOS di Bologna and MISTER Smart Innovation Bologna ItalyDepartment of Engineering University of Ferrara Ferrara ItalyAbstract In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing.https://doi.org/10.1049/htl2.12006
collection DOAJ
language English
format Article
sources DOAJ
author Cristoforo Decaro
Giovanni Battista Montanari
Marco Bianconi
Gaetano Bellanca
spellingShingle Cristoforo Decaro
Giovanni Battista Montanari
Marco Bianconi
Gaetano Bellanca
Prediction of hematocrit through imbalanced dataset of blood spectra
Healthcare Technology Letters
author_facet Cristoforo Decaro
Giovanni Battista Montanari
Marco Bianconi
Gaetano Bellanca
author_sort Cristoforo Decaro
title Prediction of hematocrit through imbalanced dataset of blood spectra
title_short Prediction of hematocrit through imbalanced dataset of blood spectra
title_full Prediction of hematocrit through imbalanced dataset of blood spectra
title_fullStr Prediction of hematocrit through imbalanced dataset of blood spectra
title_full_unstemmed Prediction of hematocrit through imbalanced dataset of blood spectra
title_sort prediction of hematocrit through imbalanced dataset of blood spectra
publisher Wiley
series Healthcare Technology Letters
issn 2053-3713
publishDate 2021-04-01
description Abstract In spite of machine learning has been successfully used in a wide range of healthcare applications, there are several parameters that could influence the performance of a machine learning system. One of the big issues for a machine learning algorithm is related to imbalanced dataset. An imbalanced dataset occurs when the distribution of data is not uniform. This makes harder the implementation of accurate models. In this paper, intelligent models are implemented to predict the hematocrit level of blood starting from visible spectral data. The aim of this work is to show the effects of two balancing techniques (SMOTE and SMOTE+ENN) on the imbalanced dataset of blood spectra. Four different machine learning systems are fitted with imbalanced and balanced datasets and their performances are compared showing an improvement, in terms of accuracy, due to the use of balancing.
url https://doi.org/10.1049/htl2.12006
work_keys_str_mv AT cristoforodecaro predictionofhematocritthroughimbalanceddatasetofbloodspectra
AT giovannibattistamontanari predictionofhematocritthroughimbalanceddatasetofbloodspectra
AT marcobianconi predictionofhematocritthroughimbalanceddatasetofbloodspectra
AT gaetanobellanca predictionofhematocritthroughimbalanceddatasetofbloodspectra
_version_ 1721517771473289216