IgA Nephropathy Prediction in Children with Machine Learning Algorithms

Immunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nan...

Full description

Bibliographic Details
Main Authors: Ping Zhang, Rongqin Wang, Nianfeng Shi
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Future Internet
Subjects:
Online Access:https://www.mdpi.com/1999-5903/12/12/230
id doaj-d60f9ec54f5846bda542ba86e2020e8f
record_format Article
spelling doaj-d60f9ec54f5846bda542ba86e2020e8f2020-12-18T00:04:06ZengMDPI AGFuture Internet1999-59032020-12-011223023010.3390/fi12120230IgA Nephropathy Prediction in Children with Machine Learning AlgorithmsPing Zhang0Rongqin Wang1Nianfeng Shi2School of Mathematics and Statistics, Henan University of Science and Technology, Luoyang 471000, ChinaSchool of Information Engineering, Zhengzhou University, Zhengzhou 450000, ChinaSchool of Computer and Information Engineering, Luoyang Institute of Science and Technology, Luoyang 471000, ChinaImmunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nanjing Eastern War Zone Hospital, chose eXtreme Gradient Boosting (XGBoost), random forest (RF), CatBoost, support vector machines (SVM), k-nearest neighbor (KNN), and extreme learning machine (ELM) models in order to predict the probability that the patient would not reach or reach end-stage renal disease (ESRD) within five years, used the chi-square test to select the most relevant 16 features as the input of the model, and designed a decision-making system (DMS) of IgAN prediction in children that is based on XGBoost and Django framework. The receiver operating characteristic (ROC) curve was used in order to evaluate the performance of the models and XGBoost had the best performance by comparison. The AUC value, accuracy, precision, recall, and f1-score of XGBoost were 85.11%, 78.60%, 75.96%, 76.70%, and 76.33%, respectively. The XGBoost model is useful for physicians and pediatric patients in providing predictions regarding IgAN. As an advantage, a DMS can be designed based on the XGBoost model to assist a physician to effectively treat IgAN in children for preventing deterioration.https://www.mdpi.com/1999-5903/12/12/230IgA nephropathymachine learningXGBoostpredictiondecision-making system
collection DOAJ
language English
format Article
sources DOAJ
author Ping Zhang
Rongqin Wang
Nianfeng Shi
spellingShingle Ping Zhang
Rongqin Wang
Nianfeng Shi
IgA Nephropathy Prediction in Children with Machine Learning Algorithms
Future Internet
IgA nephropathy
machine learning
XGBoost
prediction
decision-making system
author_facet Ping Zhang
Rongqin Wang
Nianfeng Shi
author_sort Ping Zhang
title IgA Nephropathy Prediction in Children with Machine Learning Algorithms
title_short IgA Nephropathy Prediction in Children with Machine Learning Algorithms
title_full IgA Nephropathy Prediction in Children with Machine Learning Algorithms
title_fullStr IgA Nephropathy Prediction in Children with Machine Learning Algorithms
title_full_unstemmed IgA Nephropathy Prediction in Children with Machine Learning Algorithms
title_sort iga nephropathy prediction in children with machine learning algorithms
publisher MDPI AG
series Future Internet
issn 1999-5903
publishDate 2020-12-01
description Immunoglobulin A nephropathy (IgAN) is the most common primary glomerular disease all over the world and it is a major cause of renal failure. IgAN prediction in children with machine learning algorithms has been rarely studied. We retrospectively analyzed the electronic medical records from the Nanjing Eastern War Zone Hospital, chose eXtreme Gradient Boosting (XGBoost), random forest (RF), CatBoost, support vector machines (SVM), k-nearest neighbor (KNN), and extreme learning machine (ELM) models in order to predict the probability that the patient would not reach or reach end-stage renal disease (ESRD) within five years, used the chi-square test to select the most relevant 16 features as the input of the model, and designed a decision-making system (DMS) of IgAN prediction in children that is based on XGBoost and Django framework. The receiver operating characteristic (ROC) curve was used in order to evaluate the performance of the models and XGBoost had the best performance by comparison. The AUC value, accuracy, precision, recall, and f1-score of XGBoost were 85.11%, 78.60%, 75.96%, 76.70%, and 76.33%, respectively. The XGBoost model is useful for physicians and pediatric patients in providing predictions regarding IgAN. As an advantage, a DMS can be designed based on the XGBoost model to assist a physician to effectively treat IgAN in children for preventing deterioration.
topic IgA nephropathy
machine learning
XGBoost
prediction
decision-making system
url https://www.mdpi.com/1999-5903/12/12/230
work_keys_str_mv AT pingzhang iganephropathypredictioninchildrenwithmachinelearningalgorithms
AT rongqinwang iganephropathypredictioninchildrenwithmachinelearningalgorithms
AT nianfengshi iganephropathypredictioninchildrenwithmachinelearningalgorithms
_version_ 1724378925825523712