Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning

This paper presents a neural network-based classifier to predict whether a person is at risk of developing chronic kidney disease (CKD). The model is trained with the demographic data and medical care information of two population groups: on the one hand, people diagnosed with CKD in Colombia during...

Full description

Bibliographic Details
Main Authors: Gabriel R. Vasquez-Morales, Sergio M. Martinez-Monterrubio, Pablo Moreno-Ger, Juan A. Recio-Garcia
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8877828/
id doaj-e7c2cd7329ee4b1bad131d8241fa27a7
record_format Article
spelling doaj-e7c2cd7329ee4b1bad131d8241fa27a72021-03-29T23:03:42ZengIEEEIEEE Access2169-35362019-01-01715290015291010.1109/ACCESS.2019.29484308877828Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based ReasoningGabriel R. Vasquez-Morales0https://orcid.org/0000-0001-8731-6195Sergio M. Martinez-Monterrubio1Pablo Moreno-Ger2Juan A. Recio-Garcia3Office of Information and Communications Technology, Ministry of Health and Social Protection, Bogotá, ColombiaDepartment of Software Engineering and Artificial Intelligence, Faculty of Computer Science, Group of Artificial Intelligence Applications, Universidad Complutense de Madrid, Ciudad Universitaria, Madrid, SpainSchool of Engineering, Universidad Internacional de La Rioja (UNIR), Logroño, SpainOffice of Information and Communications Technology, Ministry of Health and Social Protection, Bogotá, ColombiaThis paper presents a neural network-based classifier to predict whether a person is at risk of developing chronic kidney disease (CKD). The model is trained with the demographic data and medical care information of two population groups: on the one hand, people diagnosed with CKD in Colombia during 2018, and on the other, a sample of people without a diagnosis of this disease. Once the model is trained and evaluation metrics for classification algorithms are applied, the model achieves 95% accuracy in the test data set, making its application for disease prognosis feasible. However, despite the demonstrated efficiency of the neural networks to predict CKD, this machine-learning paradigm is opaque to the expert regarding the explanation of the outcome. Current research on eXplainable AI proposes the use of twin systems, where a black-box machine-learning method is complemented by another white-box method that provides explanations about the predicted values. Case-Based Reasoning (CBR) has proved to be an ideal complement as this paradigm is able to find explanatory cases for an explanation-by-example justification of a neural network's prediction. In this paper, we apply and validate a NN-CBR twin system for the explanation of CKD predictions. As a result of this research, 3,494,516 people were identified as being at risk of developing CKD in Colombia, or 7% of the total population.https://ieeexplore.ieee.org/document/8877828/Chronic kidney disease predictionneural networkscase-based reasoningtwin systemsexplainable AIsupport vector machines
collection DOAJ
language English
format Article
sources DOAJ
author Gabriel R. Vasquez-Morales
Sergio M. Martinez-Monterrubio
Pablo Moreno-Ger
Juan A. Recio-Garcia
spellingShingle Gabriel R. Vasquez-Morales
Sergio M. Martinez-Monterrubio
Pablo Moreno-Ger
Juan A. Recio-Garcia
Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning
IEEE Access
Chronic kidney disease prediction
neural networks
case-based reasoning
twin systems
explainable AI
support vector machines
author_facet Gabriel R. Vasquez-Morales
Sergio M. Martinez-Monterrubio
Pablo Moreno-Ger
Juan A. Recio-Garcia
author_sort Gabriel R. Vasquez-Morales
title Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning
title_short Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning
title_full Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning
title_fullStr Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning
title_full_unstemmed Explainable Prediction of Chronic Renal Disease in the Colombian Population Using Neural Networks and Case-Based Reasoning
title_sort explainable prediction of chronic renal disease in the colombian population using neural networks and case-based reasoning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description This paper presents a neural network-based classifier to predict whether a person is at risk of developing chronic kidney disease (CKD). The model is trained with the demographic data and medical care information of two population groups: on the one hand, people diagnosed with CKD in Colombia during 2018, and on the other, a sample of people without a diagnosis of this disease. Once the model is trained and evaluation metrics for classification algorithms are applied, the model achieves 95% accuracy in the test data set, making its application for disease prognosis feasible. However, despite the demonstrated efficiency of the neural networks to predict CKD, this machine-learning paradigm is opaque to the expert regarding the explanation of the outcome. Current research on eXplainable AI proposes the use of twin systems, where a black-box machine-learning method is complemented by another white-box method that provides explanations about the predicted values. Case-Based Reasoning (CBR) has proved to be an ideal complement as this paradigm is able to find explanatory cases for an explanation-by-example justification of a neural network's prediction. In this paper, we apply and validate a NN-CBR twin system for the explanation of CKD predictions. As a result of this research, 3,494,516 people were identified as being at risk of developing CKD in Colombia, or 7% of the total population.
topic Chronic kidney disease prediction
neural networks
case-based reasoning
twin systems
explainable AI
support vector machines
url https://ieeexplore.ieee.org/document/8877828/
work_keys_str_mv AT gabrielrvasquezmorales explainablepredictionofchronicrenaldiseaseinthecolombianpopulationusingneuralnetworksandcasebasedreasoning
AT sergiommartinezmonterrubio explainablepredictionofchronicrenaldiseaseinthecolombianpopulationusingneuralnetworksandcasebasedreasoning
AT pablomorenoger explainablepredictionofchronicrenaldiseaseinthecolombianpopulationusingneuralnetworksandcasebasedreasoning
AT juanareciogarcia explainablepredictionofchronicrenaldiseaseinthecolombianpopulationusingneuralnetworksandcasebasedreasoning
_version_ 1724190158686781440