IgA Nephropathy Prediction in Children with Machine Learning Algorithms
Immunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nan...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-12-01
|
Series: | Future Internet |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-5903/12/12/230 |
id |
doaj-d60f9ec54f5846bda542ba86e2020e8f |
---|---|
record_format |
Article |
spelling |
doaj-d60f9ec54f5846bda542ba86e2020e8f2020-12-18T00:04:06ZengMDPI AGFuture Internet1999-59032020-12-011223023010.3390/fi12120230IgA Nephropathy Prediction in Children with Machine Learning AlgorithmsPing Zhang0Rongqin Wang1Nianfeng Shi2School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang 471000, ChinaSchool of Information Engineering, Zhengzhou University, Zhengzhou 450000, ChinaSchool of Computer and Information Engineering, Luoyang Institute of Science and Technology, Luoyang 471000, ChinaImmunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nanjing Eastern War Zone Hospital, chose eXtreme Gradient Boosting (XGBoost), random forest (RF), CatBoost, support vector machines (SVM), k-nearest neighbor (KNN), and extreme learning machine (ELM) models in order to predict the probability that the patient would not reach or reach end-stage renal disease (ESRD) within five years, used the chi-square test to select the most relevant 16 features as the input of the model, and designed a decision-making system (DMS) of IgAN prediction in children that is based on XGBoost and Django framework. The receiver operating characteristic (ROC) curve was used in order to evaluate the performance of the models and XGBoost had the best performance by comparison. The AUC value, accuracy, precision, recall, and f1-score of XGBoost were 85.11%, 78.60%, 75.96%, 76.70%, and 76.33%, respectively. The XGBoost model is useful for physicians and pediatric patients in providing predictions regarding IgAN. As an advantage, a DMS can be designed based on the XGBoost model to assist a physician to effectively treat IgAN in children for preventing deterioration.https://www.mdpi.com/1999-5903/12/12/230IgA nephropathymachine learningXGBoostpredictiondecision-making system |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ping Zhang Rongqin Wang Nianfeng Shi |
spellingShingle |
Ping Zhang Rongqin Wang Nianfeng Shi IgA Nephropathy Prediction in Children with Machine Learning Algorithms Future Internet IgA nephropathy machine learning XGBoost prediction decision-making system |
author_facet |
Ping Zhang Rongqin Wang Nianfeng Shi |
author_sort |
Ping Zhang |
title |
IgA Nephropathy Prediction in Children with Machine Learning Algorithms |
title_short |
IgA Nephropathy Prediction in Children with Machine Learning Algorithms |
title_full |
IgA Nephropathy Prediction in Children with Machine Learning Algorithms |
title_fullStr |
IgA Nephropathy Prediction in Children with Machine Learning Algorithms |
title_full_unstemmed |
IgA Nephropathy Prediction in Children with Machine Learning Algorithms |
title_sort |
iga nephropathy prediction in children with machine learning algorithms |
publisher |
MDPI AG |
series |
Future Internet |
issn |
1999-5903 |
publishDate |
2020-12-01 |
description |
Immunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nanjing Eastern War Zone Hospital, chose eXtreme Gradient Boosting (XGBoost), random forest (RF), CatBoost, support vector machines (SVM), k-nearest neighbor (KNN), and extreme learning machine (ELM) models in order to predict the probability that the patient would not reach or reach end-stage renal disease (ESRD) within five years, used the chi-square test to select the most relevant 16 features as the input of the model, and designed a decision-making system (DMS) of IgAN prediction in children that is based on XGBoost and Django framework. The receiver operating characteristic (ROC) curve was used in order to evaluate the performance of the models and XGBoost had the best performance by comparison. The AUC value, accuracy, precision, recall, and f1-score of XGBoost were 85.11%, 78.60%, 75.96%, 76.70%, and 76.33%, respectively. The XGBoost model is useful for physicians and pediatric patients in providing predictions regarding IgAN. As an advantage, a DMS can be designed based on the XGBoost model to assist a physician to effectively treat IgAN in children for preventing deterioration. |
topic |
IgA nephropathy machine learning XGBoost prediction decision-making system |
url |
https://www.mdpi.com/1999-5903/12/12/230 |
work_keys_str_mv |
AT pingzhang iganephropathypredictioninchildrenwithmachinelearningalgorithms AT rongqinwang iganephropathypredictioninchildrenwithmachinelearningalgorithms AT nianfengshi iganephropathypredictioninchildrenwithmachinelearningalgorithms |
_version_ |
1724378925825523712 |