Predicting mortality risk for preterm infants using random forest

Abstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical dat...

Full description

Bibliographic Details
Main Authors:	Jennifer Lee, Jinjin Cai, Fuhai Li, Zachary A. Vesoulis
Format:	Article
Language:	English
Published:	Nature Publishing Group 2021-03-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-021-86748-4

id	doaj-5f2c133598bb4e5e884623873be2e4dc
record_format	Article
spelling	doaj-5f2c133598bb4e5e884623873be2e4dc2021-04-04T11:32:03ZengNature Publishing GroupScientific Reports2045-23222021-03-011111910.1038/s41598-021-86748-4Predicting mortality risk for preterm infants using random forestJennifer Lee0Jinjin Cai1Fuhai Li2Zachary A. Vesoulis3Washington University School of MedicineDivision of Biostatistics, Washington University School of MedicineInstitute for Informatics, Washington UniversityDepartment of Pediatrics, Division of Newborn Medicine, Washington University School of MedicineAbstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical data were included in this retrospective study. Mortality risk was quantified by conventional means (clinical factors) using the CRIB-II score and the optimal logistic regression model. A random forest (RF) model was trained using a subset of the cohort, labeling data within 6 h of death as “worry.” The model was then tested using the remaining infants. A total of 275 infants were included in the study with a mean gestational age of 27 weeks, mean birth weight of 929 g, 49% female, and a mortality rate of 21%. The CRIB-II and logistic regression models had acceptable performance with sensitivities of 71% and 80% AUC scores of 0.78 and 0.84, respectively. The RF model had superior performance with a sensitivity of 88% and an AUC of 0.93. A random forest model which incorporates fixed clinical factors with the influence of aberrancies in subsequent physiology has superior performance for mortality prediction compared to conventional models.https://doi.org/10.1038/s41598-021-86748-4
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jennifer Lee Jinjin Cai Fuhai Li Zachary A. Vesoulis
spellingShingle	Jennifer Lee Jinjin Cai Fuhai Li Zachary A. Vesoulis Predicting mortality risk for preterm infants using random forest Scientific Reports
author_facet	Jennifer Lee Jinjin Cai Fuhai Li Zachary A. Vesoulis
author_sort	Jennifer Lee
title	Predicting mortality risk for preterm infants using random forest
title_short	Predicting mortality risk for preterm infants using random forest
title_full	Predicting mortality risk for preterm infants using random forest
title_fullStr	Predicting mortality risk for preterm infants using random forest
title_full_unstemmed	Predicting mortality risk for preterm infants using random forest
title_sort	predicting mortality risk for preterm infants using random forest
publisher	Nature Publishing Group
series	Scientific Reports
issn	2045-2322
publishDate	2021-03-01
description	Abstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical data were included in this retrospective study. Mortality risk was quantified by conventional means (clinical factors) using the CRIB-II score and the optimal logistic regression model. A random forest (RF) model was trained using a subset of the cohort, labeling data within 6 h of death as “worry.” The model was then tested using the remaining infants. A total of 275 infants were included in the study with a mean gestational age of 27 weeks, mean birth weight of 929 g, 49% female, and a mortality rate of 21%. The CRIB-II and logistic regression models had acceptable performance with sensitivities of 71% and 80% AUC scores of 0.78 and 0.84, respectively. The RF model had superior performance with a sensitivity of 88% and an AUC of 0.93. A random forest model which incorporates fixed clinical factors with the influence of aberrancies in subsequent physiology has superior performance for mortality prediction compared to conventional models.
url	https://doi.org/10.1038/s41598-021-86748-4
work_keys_str_mv	AT jenniferlee predictingmortalityriskforpreterminfantsusingrandomforest AT jinjincai predictingmortalityriskforpreterminfantsusingrandomforest AT fuhaili predictingmortalityriskforpreterminfantsusingrandomforest AT zacharyavesoulis predictingmortalityriskforpreterminfantsusingrandomforest
_version_	1721542646101442560

Predicting mortality risk for preterm infants using random forest

Similar Items