Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values

The early and accurate detection of the onset of acute myocardial infarction (AMI) is imperative for the timely provision of medical intervention and the reduction of its mortality rate. Machine learning techniques have demonstrated great potential in aiding disease diagnosis. In this paper, we pres...

Full description

Bibliographic Details
Main Authors: Lujain Ibrahim, Munib Mesinovic, Kai-Wen Yang, Mohamad A. Eid
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9268965/
id doaj-481467d3a010464eb2d5aee2e0a75e0d
record_format Article
spelling doaj-481467d3a010464eb2d5aee2e0a75e0d2021-03-30T04:55:27ZengIEEEIEEE Access2169-35362020-01-01821041021041710.1109/ACCESS.2020.30401669268965Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley ValuesLujain Ibrahim0https://orcid.org/0000-0002-0395-784XMunib Mesinovic1https://orcid.org/0000-0003-3757-7877Kai-Wen Yang2https://orcid.org/0000-0002-3915-4280Mohamad A. Eid3Division of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesDivision of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesDivision of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesDivision of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesThe early and accurate detection of the onset of acute myocardial infarction (AMI) is imperative for the timely provision of medical intervention and the reduction of its mortality rate. Machine learning techniques have demonstrated great potential in aiding disease diagnosis. In this paper, we present a framework to predict the onset of AMI using 713,447 extracted ECG samples and associated auxiliary data from the longitudinal and comprehensive ECG-ViEW II database, previously unexplored in the field of machine learning in healthcare. The framework is realized with two deep learning models, a convolutional neural network (CNN) and a recurrent neural network (RNN), and a decision-tree based model, XGBoost. Synthetic minority oversampling technique (SMOTE) was utilized to address class imbalance. High prediction accuracy of 89.9%, 84.6%, 97.5% and ROC curve areas of 90.7%, 82.9%, 96.5% have been achieved for the best CNN, RNN, and XGBoost models, respectively. Shapley values were utilized to identify the features that contributed most to the classification decision with XGBoost, demonstrating the high impact of auxiliary inputs such as age and sex. This paper demonstrates the promising application of explainable machine learning in the field of cardiovascular disease prediction.https://ieeexplore.ieee.org/document/9268965/Machine learningbiomedical informaticspredictive modelsacute myocardial infarction
collection DOAJ
language English
format Article
sources DOAJ
author Lujain Ibrahim
Munib Mesinovic
Kai-Wen Yang
Mohamad A. Eid
spellingShingle Lujain Ibrahim
Munib Mesinovic
Kai-Wen Yang
Mohamad A. Eid
Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
IEEE Access
Machine learning
biomedical informatics
predictive models
acute myocardial infarction
author_facet Lujain Ibrahim
Munib Mesinovic
Kai-Wen Yang
Mohamad A. Eid
author_sort Lujain Ibrahim
title Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
title_short Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
title_full Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
title_fullStr Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
title_full_unstemmed Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
title_sort explainable prediction of acute myocardial infarction using machine learning and shapley values
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description The early and accurate detection of the onset of acute myocardial infarction (AMI) is imperative for the timely provision of medical intervention and the reduction of its mortality rate. Machine learning techniques have demonstrated great potential in aiding disease diagnosis. In this paper, we present a framework to predict the onset of AMI using 713,447 extracted ECG samples and associated auxiliary data from the longitudinal and comprehensive ECG-ViEW II database, previously unexplored in the field of machine learning in healthcare. The framework is realized with two deep learning models, a convolutional neural network (CNN) and a recurrent neural network (RNN), and a decision-tree based model, XGBoost. Synthetic minority oversampling technique (SMOTE) was utilized to address class imbalance. High prediction accuracy of 89.9%, 84.6%, 97.5% and ROC curve areas of 90.7%, 82.9%, 96.5% have been achieved for the best CNN, RNN, and XGBoost models, respectively. Shapley values were utilized to identify the features that contributed most to the classification decision with XGBoost, demonstrating the high impact of auxiliary inputs such as age and sex. This paper demonstrates the promising application of explainable machine learning in the field of cardiovascular disease prediction.
topic Machine learning
biomedical informatics
predictive models
acute myocardial infarction
url https://ieeexplore.ieee.org/document/9268965/
work_keys_str_mv AT lujainibrahim explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues
AT munibmesinovic explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues
AT kaiwenyang explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues
AT mohamadaeid explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues
_version_ 1724180996674289664