Predicting mortality risk for preterm infants using random forest

Abstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical dat...

Full description

Bibliographic Details
Main Authors: Jennifer Lee, Jinjin Cai, Fuhai Li, Zachary A. Vesoulis
Format: Article
Language:English
Published: Nature Publishing Group 2021-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-86748-4
id doaj-5f2c133598bb4e5e884623873be2e4dc
record_format Article
spelling doaj-5f2c133598bb4e5e884623873be2e4dc2021-04-04T11:32:03ZengNature Publishing GroupScientific Reports2045-23222021-03-011111910.1038/s41598-021-86748-4Predicting mortality risk for preterm infants using random forestJennifer Lee0Jinjin Cai1Fuhai Li2Zachary A. Vesoulis3Washington University School of MedicineDivision of Biostatistics, Washington University School of MedicineInstitute for Informatics, Washington UniversityDepartment of Pediatrics, Division of Newborn Medicine, Washington University School of MedicineAbstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical data were included in this retrospective study. Mortality risk was quantified by conventional means (clinical factors) using the CRIB-II score and the optimal logistic regression model. A random forest (RF) model was trained using a subset of the cohort, labeling data within 6 h of death as “worry.” The model was then tested using the remaining infants. A total of 275 infants were included in the study with a mean gestational age of 27 weeks, mean birth weight of 929 g, 49% female, and a mortality rate of 21%. The CRIB-II and logistic regression models had acceptable performance with sensitivities of 71% and 80% AUC scores of 0.78 and 0.84, respectively. The RF model had superior performance with a sensitivity of 88% and an AUC of 0.93. A random forest model which incorporates fixed clinical factors with the influence of aberrancies in subsequent physiology has superior performance for mortality prediction compared to conventional models.https://doi.org/10.1038/s41598-021-86748-4
collection DOAJ
language English
format Article
sources DOAJ
author Jennifer Lee
Jinjin Cai
Fuhai Li
Zachary A. Vesoulis
spellingShingle Jennifer Lee
Jinjin Cai
Fuhai Li
Zachary A. Vesoulis
Predicting mortality risk for preterm infants using random forest
Scientific Reports
author_facet Jennifer Lee
Jinjin Cai
Fuhai Li
Zachary A. Vesoulis
author_sort Jennifer Lee
title Predicting mortality risk for preterm infants using random forest
title_short Predicting mortality risk for preterm infants using random forest
title_full Predicting mortality risk for preterm infants using random forest
title_fullStr Predicting mortality risk for preterm infants using random forest
title_full_unstemmed Predicting mortality risk for preterm infants using random forest
title_sort predicting mortality risk for preterm infants using random forest
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-03-01
description Abstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical data were included in this retrospective study. Mortality risk was quantified by conventional means (clinical factors) using the CRIB-II score and the optimal logistic regression model. A random forest (RF) model was trained using a subset of the cohort, labeling data within 6 h of death as “worry.” The model was then tested using the remaining infants. A total of 275 infants were included in the study with a mean gestational age of 27 weeks, mean birth weight of 929 g, 49% female, and a mortality rate of 21%. The CRIB-II and logistic regression models had acceptable performance with sensitivities of 71% and 80% AUC scores of 0.78 and 0.84, respectively. The RF model had superior performance with a sensitivity of 88% and an AUC of 0.93. A random forest model which incorporates fixed clinical factors with the influence of aberrancies in subsequent physiology has superior performance for mortality prediction compared to conventional models.
url https://doi.org/10.1038/s41598-021-86748-4
work_keys_str_mv AT jenniferlee predictingmortalityriskforpreterminfantsusingrandomforest
AT jinjincai predictingmortalityriskforpreterminfantsusingrandomforest
AT fuhaili predictingmortalityriskforpreterminfantsusingrandomforest
AT zacharyavesoulis predictingmortalityriskforpreterminfantsusingrandomforest
_version_ 1721542646101442560