Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values
The early and accurate detection of the onset of acute myocardial infarction (AMI) is imperative for the timely provision of medical intervention and the reduction of its mortality rate. Machine learning techniques have demonstrated great potential in aiding disease diagnosis. In this paper, we pres...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9268965/ |
id |
doaj-481467d3a010464eb2d5aee2e0a75e0d |
---|---|
record_format |
Article |
spelling |
doaj-481467d3a010464eb2d5aee2e0a75e0d2021-03-30T04:55:27ZengIEEEIEEE Access2169-35362020-01-01821041021041710.1109/ACCESS.2020.30401669268965Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley ValuesLujain Ibrahim0https://orcid.org/0000-0002-0395-784XMunib Mesinovic1https://orcid.org/0000-0003-3757-7877Kai-Wen Yang2https://orcid.org/0000-0002-3915-4280Mohamad A. Eid3Division of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesDivision of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesDivision of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesDivision of Engineering, New York University Abu Dhabi, Abu Dhabi, United Arab EmiratesThe early and accurate detection of the onset of acute myocardial infarction (AMI) is imperative for the timely provision of medical intervention and the reduction of its mortality rate. Machine learning techniques have demonstrated great potential in aiding disease diagnosis. In this paper, we present a framework to predict the onset of AMI using 713,447 extracted ECG samples and associated auxiliary data from the longitudinal and comprehensive ECG-ViEW II database, previously unexplored in the field of machine learning in healthcare. The framework is realized with two deep learning models, a convolutional neural network (CNN) and a recurrent neural network (RNN), and a decision-tree based model, XGBoost. Synthetic minority oversampling technique (SMOTE) was utilized to address class imbalance. High prediction accuracy of 89.9%, 84.6%, 97.5% and ROC curve areas of 90.7%, 82.9%, 96.5% have been achieved for the best CNN, RNN, and XGBoost models, respectively. Shapley values were utilized to identify the features that contributed most to the classification decision with XGBoost, demonstrating the high impact of auxiliary inputs such as age and sex. This paper demonstrates the promising application of explainable machine learning in the field of cardiovascular disease prediction.https://ieeexplore.ieee.org/document/9268965/Machine learningbiomedical informaticspredictive modelsacute myocardial infarction |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lujain Ibrahim Munib Mesinovic Kai-Wen Yang Mohamad A. Eid |
spellingShingle |
Lujain Ibrahim Munib Mesinovic Kai-Wen Yang Mohamad A. Eid Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values IEEE Access Machine learning biomedical informatics predictive models acute myocardial infarction |
author_facet |
Lujain Ibrahim Munib Mesinovic Kai-Wen Yang Mohamad A. Eid |
author_sort |
Lujain Ibrahim |
title |
Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values |
title_short |
Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values |
title_full |
Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values |
title_fullStr |
Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values |
title_full_unstemmed |
Explainable Prediction of Acute Myocardial Infarction Using Machine Learning and Shapley Values |
title_sort |
explainable prediction of acute myocardial infarction using machine learning and shapley values |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
The early and accurate detection of the onset of acute myocardial infarction (AMI) is imperative for the timely provision of medical intervention and the reduction of its mortality rate. Machine learning techniques have demonstrated great potential in aiding disease diagnosis. In this paper, we present a framework to predict the onset of AMI using 713,447 extracted ECG samples and associated auxiliary data from the longitudinal and comprehensive ECG-ViEW II database, previously unexplored in the field of machine learning in healthcare. The framework is realized with two deep learning models, a convolutional neural network (CNN) and a recurrent neural network (RNN), and a decision-tree based model, XGBoost. Synthetic minority oversampling technique (SMOTE) was utilized to address class imbalance. High prediction accuracy of 89.9%, 84.6%, 97.5% and ROC curve areas of 90.7%, 82.9%, 96.5% have been achieved for the best CNN, RNN, and XGBoost models, respectively. Shapley values were utilized to identify the features that contributed most to the classification decision with XGBoost, demonstrating the high impact of auxiliary inputs such as age and sex. This paper demonstrates the promising application of explainable machine learning in the field of cardiovascular disease prediction. |
topic |
Machine learning biomedical informatics predictive models acute myocardial infarction |
url |
https://ieeexplore.ieee.org/document/9268965/ |
work_keys_str_mv |
AT lujainibrahim explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues AT munibmesinovic explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues AT kaiwenyang explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues AT mohamadaeid explainablepredictionofacutemyocardialinfarctionusingmachinelearningandshapleyvalues |
_version_ |
1724180996674289664 |