Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records

Summary: Background: Many mortality prediction models have been developed for patients in intensive care units (ICUs); most are based on data available at ICU admission. We investigated whether machine learning methods using analyses of time-series data improved mortality prognostication for patien...

Full description

Bibliographic Details
Main Authors: Hans-Christian Thorsen-Meyer, MD, Annelaura B Nielsen, PhD, Anna P Nielsen, MD, Benjamin Skov Kaas-Hansen, MD, Palle Toft, ProfDMSc, Jens Schierbeck, MD, Thomas Strøm, PhD, Piotr J Chmura, MSc, Marc Heimann, BSc, Lars Dybdahl, MSc, Lasse Spangsege, MSc, Patrick Hulsen, BSc, Kirstine Belling, PhD, Søren Brunak, ProfPhD, Anders Perner, ProfPhD
Format: Article
Language:English
Published: Elsevier 2020-04-01
Series:The Lancet: Digital Health
Online Access:http://www.sciencedirect.com/science/article/pii/S2589750020300182
id doaj-3ba93ebadb0841d38f9b17d3de6b2490
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Hans-Christian Thorsen-Meyer, MD
Annelaura B Nielsen, PhD
Anna P Nielsen, MD
Benjamin Skov Kaas-Hansen, MD
Palle Toft, ProfDMSc
Jens Schierbeck, MD
Thomas Strøm, PhD
Piotr J Chmura, MSc
Marc Heimann, BSc
Lars Dybdahl, MSc
Lasse Spangsege, MSc
Patrick Hulsen, BSc
Kirstine Belling, PhD
Søren Brunak, ProfPhD
Anders Perner, ProfPhD
spellingShingle Hans-Christian Thorsen-Meyer, MD
Annelaura B Nielsen, PhD
Anna P Nielsen, MD
Benjamin Skov Kaas-Hansen, MD
Palle Toft, ProfDMSc
Jens Schierbeck, MD
Thomas Strøm, PhD
Piotr J Chmura, MSc
Marc Heimann, BSc
Lars Dybdahl, MSc
Lasse Spangsege, MSc
Patrick Hulsen, BSc
Kirstine Belling, PhD
Søren Brunak, ProfPhD
Anders Perner, ProfPhD
Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
The Lancet: Digital Health
author_facet Hans-Christian Thorsen-Meyer, MD
Annelaura B Nielsen, PhD
Anna P Nielsen, MD
Benjamin Skov Kaas-Hansen, MD
Palle Toft, ProfDMSc
Jens Schierbeck, MD
Thomas Strøm, PhD
Piotr J Chmura, MSc
Marc Heimann, BSc
Lars Dybdahl, MSc
Lasse Spangsege, MSc
Patrick Hulsen, BSc
Kirstine Belling, PhD
Søren Brunak, ProfPhD
Anders Perner, ProfPhD
author_sort Hans-Christian Thorsen-Meyer, MD
title Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
title_short Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
title_full Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
title_fullStr Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
title_full_unstemmed Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
title_sort dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient records
publisher Elsevier
series The Lancet: Digital Health
issn 2589-7500
publishDate 2020-04-01
description Summary: Background: Many mortality prediction models have been developed for patients in intensive care units (ICUs); most are based on data available at ICU admission. We investigated whether machine learning methods using analyses of time-series data improved mortality prognostication for patients in the ICU by providing real-time predictions of 90-day mortality. In addition, we examined to what extent such a dynamic model could be made interpretable by quantifying and visualising the features that drive the predictions at different timepoints. Methods: Based on the Simplified Acute Physiology Score (SAPS) III variables, we trained a machine learning model on longitudinal data from patients admitted to four ICUs in the Capital Region, Denmark, between 2011 and 2016. We included all patients older than 16 years of age, with an ICU stay lasting more than 1 h, and who had a Danish civil registration number to enable 90-day follow-up. We leveraged static data and physiological time-series data from electronic health records and the Danish National Patient Registry. A recurrent neural network was trained with a temporal resolution of 1 h. The model was internally validated using the holdout method with 20% of the training dataset and externally validated using previously unseen data from a fifth hospital in Denmark. Its performance was assessed with the Matthews correlation coefficient (MCC) and area under the receiver operating characteristic curve (AUROC) as metrics, using bootstrapping with 1000 samples with replacement to construct 95% CIs. A Shapley additive explanations algorithm was applied to the prediction model to obtain explanations of the features that drive patient-specific predictions, and the contributions of each of the 44 features in the model were analysed and compared with the variables in the original SAPS III model. Findings: From a dataset containing 15 615 ICU admissions of 12 616 patients, we included 14 190 admissions of 11 492 patients in our analysis. Overall, 90-day mortality was 33·1% (3802 patients). The deep learning model showed a predictive performance on the holdout testing dataset that improved over the timecourse of an ICU stay: MCC 0·29 (95% CI 0·25–0·33) and AUROC 0·73 (0·71–0·74) at admission, 0·43 (0·40–0·47) and 0·82 (0·80–0·84) after 24 h, 0·50 (0·46–0·53) and 0·85 (0·84–0·87) after 72 h, and 0·57 (0·54–0·60) and 0·88 (0·87–0·89) at the time of discharge. The model exhibited good calibration properties. These results were validated in an external validation cohort of 5827 patients with 6748 admissions: MCC 0·29 (95% CI 0·27–0·32) and AUROC 0·75 (0·73–0·76) at admission, 0·41 (0·39–0·44) and 0·80 (0·79–0·81) after 24 h, 0·46 (0·43–0·48) and 0·82 (0·81–0·83) after 72 h, and 0·47 (0·44–0·49) and 0·83 (0·82–0·84) at the time of discharge. Interpretation: The prediction of 90-day mortality improved with 1-h sampling intervals during the ICU stay. The dynamic risk prediction can also be explained for an individual patient, visualising the features contributing to the prediction at any point in time. This explanation allows the clinician to determine whether there are elements in the current patient state and care that are potentially actionable, thus making the model suitable for further validation as a clinical tool. Funding: Novo Nordisk Foundation and the Innovation Fund Denmark.
url http://www.sciencedirect.com/science/article/pii/S2589750020300182
work_keys_str_mv AT hanschristianthorsenmeyermd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT annelaurabnielsenphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT annapnielsenmd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT benjaminskovkaashansenmd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT palletoftprofdmsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT jensschierbeckmd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT thomasstrømphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT piotrjchmuramsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT marcheimannbsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT larsdybdahlmsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT lassespangsegemsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT patrickhulsenbsc dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT kirstinebellingphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT sørenbrunakprofphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
AT anderspernerprofphd dynamicandexplainablemachinelearningpredictionofmortalityinpatientsintheintensivecareunitaretrospectivestudyofhighfrequencydatainelectronicpatientrecords
_version_ 1724923387787083776
spelling doaj-3ba93ebadb0841d38f9b17d3de6b24902020-11-25T02:09:30ZengElsevierThe Lancet: Digital Health2589-75002020-04-0124e179e191Dynamic and explainable machine learning prediction of mortality in patients in the intensive care unit: a retrospective study of high-frequency data in electronic patient recordsHans-Christian Thorsen-Meyer, MD0Annelaura B Nielsen, PhD1Anna P Nielsen, MD2Benjamin Skov Kaas-Hansen, MD3Palle Toft, ProfDMSc4Jens Schierbeck, MD5Thomas Strøm, PhD6Piotr J Chmura, MSc7Marc Heimann, BSc8Lars Dybdahl, MSc9Lasse Spangsege, MSc10Patrick Hulsen, BSc11Kirstine Belling, PhD12Søren Brunak, ProfPhD13Anders Perner, ProfPhD14Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Department of Intensive Care, Rigshospitalet, Copenhagen University Hospital, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Clinical Pharmacology Unit, Zealand University Hospital, Roskilde, DenmarkDepartment of Anesthesiology and Intensive Care, Odense University Hospital, Odense, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, DenmarkDepartment of Anesthesiology and Intensive Care, Odense University Hospital, Odense, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, DenmarkDepartment of Anesthesiology and Intensive Care, Odense University Hospital, Odense, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkCentre for IT, Medical Technology and Telephony Services, Capital Region of Denmark, Copenhagen, DenmarkDaintel, Lyngby, DenmarkDaintel, Lyngby, DenmarkDaintel, Lyngby, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, DenmarkNovo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark; Correspondence to: Prof Søren Brunak, Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200 Copenhagen, DenmarkDepartment of Intensive Care, Rigshospitalet, Copenhagen University Hospital, Copenhagen, DenmarkSummary: Background: Many mortality prediction models have been developed for patients in intensive care units (ICUs); most are based on data available at ICU admission. We investigated whether machine learning methods using analyses of time-series data improved mortality prognostication for patients in the ICU by providing real-time predictions of 90-day mortality. In addition, we examined to what extent such a dynamic model could be made interpretable by quantifying and visualising the features that drive the predictions at different timepoints. Methods: Based on the Simplified Acute Physiology Score (SAPS) III variables, we trained a machine learning model on longitudinal data from patients admitted to four ICUs in the Capital Region, Denmark, between 2011 and 2016. We included all patients older than 16 years of age, with an ICU stay lasting more than 1 h, and who had a Danish civil registration number to enable 90-day follow-up. We leveraged static data and physiological time-series data from electronic health records and the Danish National Patient Registry. A recurrent neural network was trained with a temporal resolution of 1 h. The model was internally validated using the holdout method with 20% of the training dataset and externally validated using previously unseen data from a fifth hospital in Denmark. Its performance was assessed with the Matthews correlation coefficient (MCC) and area under the receiver operating characteristic curve (AUROC) as metrics, using bootstrapping with 1000 samples with replacement to construct 95% CIs. A Shapley additive explanations algorithm was applied to the prediction model to obtain explanations of the features that drive patient-specific predictions, and the contributions of each of the 44 features in the model were analysed and compared with the variables in the original SAPS III model. Findings: From a dataset containing 15 615 ICU admissions of 12 616 patients, we included 14 190 admissions of 11 492 patients in our analysis. Overall, 90-day mortality was 33·1% (3802 patients). The deep learning model showed a predictive performance on the holdout testing dataset that improved over the timecourse of an ICU stay: MCC 0·29 (95% CI 0·25–0·33) and AUROC 0·73 (0·71–0·74) at admission, 0·43 (0·40–0·47) and 0·82 (0·80–0·84) after 24 h, 0·50 (0·46–0·53) and 0·85 (0·84–0·87) after 72 h, and 0·57 (0·54–0·60) and 0·88 (0·87–0·89) at the time of discharge. The model exhibited good calibration properties. These results were validated in an external validation cohort of 5827 patients with 6748 admissions: MCC 0·29 (95% CI 0·27–0·32) and AUROC 0·75 (0·73–0·76) at admission, 0·41 (0·39–0·44) and 0·80 (0·79–0·81) after 24 h, 0·46 (0·43–0·48) and 0·82 (0·81–0·83) after 72 h, and 0·47 (0·44–0·49) and 0·83 (0·82–0·84) at the time of discharge. Interpretation: The prediction of 90-day mortality improved with 1-h sampling intervals during the ICU stay. The dynamic risk prediction can also be explained for an individual patient, visualising the features contributing to the prediction at any point in time. This explanation allows the clinician to determine whether there are elements in the current patient state and care that are potentially actionable, thus making the model suitable for further validation as a clinical tool. Funding: Novo Nordisk Foundation and the Innovation Fund Denmark.http://www.sciencedirect.com/science/article/pii/S2589750020300182