Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
Summary: Background: Many mortality prediction models have been developed for patients in intensive care units (ICUs); most are based on data available at ICU admission. We investigated whether machine learning methods using analyses of time-series data improved mortality prognostication for patien...
Main Authors: | , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-04-01
|
Series: | The Lancet: Digital Health |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2589750020300182 |
id |
doaj-3ba93ebadb0841d38f9b17d3de6b2490 |
---|---|
record_format |
Article |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hans-Christian Thorsen-Meyer, MD Annelaura B Nielsen, PhD Anna P Nielsen, MD Benjamin Skov Kaas-Hansen, MD Palle Toft, ProfDMSc Jens Schierbeck, MD Thomas Strøm, PhD Piotr J Chmura, MSc Marc Heimann, BSc Lars Dybdahl, MSc Lasse Spangsege, MSc Patrick Hulsen, BSc Kirstine Belling, PhD Søren Brunak, ProfPhD Anders Perner, ProfPhD |
spellingShingle |
Hans-Christian Thorsen-Meyer, MD Annelaura B Nielsen, PhD Anna P Nielsen, MD Benjamin Skov Kaas-Hansen, MD Palle Toft, ProfDMSc Jens Schierbeck, MD Thomas Strøm, PhD Piotr J Chmura, MSc Marc Heimann, BSc Lars Dybdahl, MSc Lasse Spangsege, MSc Patrick Hulsen, BSc Kirstine Belling, PhD Søren Brunak, ProfPhD Anders Perner, ProfPhD Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records The Lancet: Digital Health |
author_facet |
Hans-Christian Thorsen-Meyer, MD Annelaura B Nielsen, PhD Anna P Nielsen, MD Benjamin Skov Kaas-Hansen, MD Palle Toft, ProfDMSc Jens Schierbeck, MD Thomas Strøm, PhD Piotr J Chmura, MSc Marc Heimann, BSc Lars Dybdahl, MSc Lasse Spangsege, MSc Patrick Hulsen, BSc Kirstine Belling, PhD Søren Brunak, ProfPhD Anders Perner, ProfPhD |
author_sort |
Hans-Christian Thorsen-Meyer, MD |
title |
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records |
title_short |
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records |
title_full |
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records |
title_fullStr |
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records |
title_full_unstemmed |
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records |
title_sort |
dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records |
publisher |
Elsevier |
series |
The Lancet: Digital Health |
issn |
2589-7500 |
publishDate |
2020-04-01 |
description |
Summary: Background: Many mortality prediction models have been developed for patients in intensive care units (ICUs); most are based on data available at ICU admission. We investigated whether machine learning methods using analyses of time-series data improved mortality prognostication for patients in the ICU by providing real-time predictions of 90-day mortality. In addition, we examined to what extent such a dynamic model could be made interpretable by quantifying and visualising the features that drive the predictions at different timepoints. Methods: Based on the Simplified Acute Physiology Score (SAPS) III variables, we trained a machine learning model on longitudinal data from patients admitted to four ICUs in the Capital Region, Denmark, between 2011 and 2016. We included all patients older than 16 years of age, with an ICU stay lasting more than 1 h, and who had a Danish civil registration number to enable 90-day follow-up. We leveraged static data and physiological time-series data from electronic health records and the Danish National Patient Registry. A recurrent neural network was trained with a temporal resolution of 1 h. The model was internally validated using the holdout method with 20% of the training dataset and externally validated using previously unseen data from a fifth hospital in Denmark. Its performance was assessed with the Matthews correlation coefficient (MCC) and area under the receiver operating characteristic curve (AUROC) as metrics, using bootstrapping with 1000 samples with replacement to construct 95% CIs. A Shapley additive explanations algorithm was applied to the prediction model to obtain explanations of the features that drive patient-specific predictions, and the contributions of each of the 44 features in the model were analysed and compared with the variables in the original SAPS III model. Findings: From a dataset containing 15 615 ICU admissions of 12 616 patients, we included 14 190 admissions of 11 492 patients in our analysis. Overall, 90-day mortality was 33·1% (3802 patients). The deep learning model showed a predictive performance on the holdout testing dataset that improved over the timecourse of an ICU stay: MCC 0·29 (95% CI 0·25–0·33) and AUROC 0·73 (0·71–0·74) at admission, 0·43 (0·40–0·47) and 0·82 (0·80–0·84) after 24 h, 0·50 (0·46–0·53) and 0·85 (0·84–0·87) after 72 h, and 0·57 (0·54–0·60) and 0·88 (0·87–0·89) at the time of discharge. The model exhibited good calibration properties. These results were validated in an external validation cohort of 5827 patients with 6748 admissions: MCC 0·29 (95% CI 0·27–0·32) and AUROC 0·75 (0·73–0·76) at admission, 0·41 (0·39–0·44) and 0·80 (0·79–0·81) after 24 h, 0·46 (0·43–0·48) and 0·82 (0·81–0·83) after 72 h, and 0·47 (0·44–0·49) and 0·83 (0·82–0·84) at the time of discharge. Interpretation: The prediction of 90-day mortality improved with 1-h sampling intervals during the ICU stay. The dynamic risk prediction can also be explained for an individual patient, visualising the features contributing to the prediction at any point in time. This explanation allows the clinician to determine whether there are elements in the current patient state and care that are potentially actionable, thus making the model suitable for further validation as a clinical tool. Funding: Novo Nordisk Foundation and the Innovation Fund Denmark. |
url |
http://www.sciencedirect.com/science/article/pii/S2589750020300182 |
work_keys_str_mv |
AT hanschristianthorsenmeyermd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT annelaurabnielsenphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT annapnielsenmd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT benjaminskovkaashansenmd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT palletoftprofdmsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT jensschierbeckmd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT thomasstrømphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT piotrjchmuramsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT marcheimannbsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT larsdybdahlmsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT lassespangsegemsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT patrickhulsenbsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT kirstinebellingphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT sørenbrunakprofphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords AT anderspernerprofphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords |
_version_ |
1724923387787083776 |
spelling |
doaj-3ba93ebadb0841d38f9b17d3de6b24902020-11-25T02:09:30ZengElsevierThe Lancet: Digital Health2589-75002020-04-0124e179e191Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient recordsHans-Christian Thorsen-Meyer, MD0Annelaura B Nielsen, PhD1Anna P Nielsen, MD2Benjamin Skov Kaas-Hansen, MD3Palle Toft, ProfDMSc4Jens Schierbeck, MD5Thomas Strøm, PhD6Piotr J Chmura, MSc7Marc Heimann, BSc8Lars Dybdahl, MSc9Lasse Spangsege, MSc10Patrick Hulsen, BSc11Kirstine Belling, PhD12Søren Brunak, ProfPhD13Anders Perner, ProfPhD14Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Department of Intensive Care, Rigshospitalet, Copenhagen University Hospital, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Clinical Pharmacology Unit, Zealand University Hospital, Roskilde, DenmarkDepartment of Anesthesiology and Intensive Care, Odense University Hospital, Odense, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, DenmarkDepartment of Anesthesiology and Intensive Care, Odense University Hospital, Odense, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, DenmarkDepartment of Anesthesiology and Intensive Care, Odense University Hospital, Odense, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkCentre for IT, Medical Technology and Telephony Services, Capital Region of Denmark, Copenhagen, DenmarkDaintel, Lyngby, DenmarkDaintel, Lyngby, DenmarkDaintel, Lyngby, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Correspondence to: Prof Søren Brunak, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200 Copenhagen, DenmarkDepartment of Intensive Care, Rigshospitalet, Copenhagen University Hospital, Copenhagen, DenmarkSummary: Background: Many mortality prediction models have been developed for patients in intensive care units (ICUs); most are based on data available at ICU admission. We investigated whether machine learning methods using analyses of time-series data improved mortality prognostication for patients in the ICU by providing real-time predictions of 90-day mortality. In addition, we examined to what extent such a dynamic model could be made interpretable by quantifying and visualising the features that drive the predictions at different timepoints. Methods: Based on the Simplified Acute Physiology Score (SAPS) III variables, we trained a machine learning model on longitudinal data from patients admitted to four ICUs in the Capital Region, Denmark, between 2011 and 2016. We included all patients older than 16 years of age, with an ICU stay lasting more than 1 h, and who had a Danish civil registration number to enable 90-day follow-up. We leveraged static data and physiological time-series data from electronic health records and the Danish National Patient Registry. A recurrent neural network was trained with a temporal resolution of 1 h. The model was internally validated using the holdout method with 20% of the training dataset and externally validated using previously unseen data from a fifth hospital in Denmark. Its performance was assessed with the Matthews correlation coefficient (MCC) and area under the receiver operating characteristic curve (AUROC) as metrics, using bootstrapping with 1000 samples with replacement to construct 95% CIs. A Shapley additive explanations algorithm was applied to the prediction model to obtain explanations of the features that drive patient-specific predictions, and the contributions of each of the 44 features in the model were analysed and compared with the variables in the original SAPS III model. Findings: From a dataset containing 15 615 ICU admissions of 12 616 patients, we included 14 190 admissions of 11 492 patients in our analysis. Overall, 90-day mortality was 33·1% (3802 patients). The deep learning model showed a predictive performance on the holdout testing dataset that improved over the timecourse of an ICU stay: MCC 0·29 (95% CI 0·25–0·33) and AUROC 0·73 (0·71–0·74) at admission, 0·43 (0·40–0·47) and 0·82 (0·80–0·84) after 24 h, 0·50 (0·46–0·53) and 0·85 (0·84–0·87) after 72 h, and 0·57 (0·54–0·60) and 0·88 (0·87–0·89) at the time of discharge. The model exhibited good calibration properties. These results were validated in an external validation cohort of 5827 patients with 6748 admissions: MCC 0·29 (95% CI 0·27–0·32) and AUROC 0·75 (0·73–0·76) at admission, 0·41 (0·39–0·44) and 0·80 (0·79–0·81) after 24 h, 0·46 (0·43–0·48) and 0·82 (0·81–0·83) after 72 h, and 0·47 (0·44–0·49) and 0·83 (0·82–0·84) at the time of discharge. Interpretation: The prediction of 90-day mortality improved with 1-h sampling intervals during the ICU stay. The dynamic risk prediction can also be explained for an individual patient, visualising the features contributing to the prediction at any point in time. This explanation allows the clinician to determine whether there are elements in the current patient state and care that are potentially actionable, thus making the model suitable for further validation as a clinical tool. Funding: Novo Nordisk Foundation and the Innovation Fund Denmark.http://www.sciencedirect.com/science/article/pii/S2589750020300182 |