Predicting mortality risk for preterm infants using random forest
Abstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical dat...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-03-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-86748-4 |
id |
doaj-5f2c133598bb4e5e884623873be2e4dc |
---|---|
record_format |
Article |
spelling |
doaj-5f2c133598bb4e5e884623873be2e4dc2021-04-04T11:32:03ZengNature Publishing GroupScientific Reports2045-23222021-03-011111910.1038/s41598-021-86748-4Predicting mortality risk for preterm infants using random forestJennifer Lee0Jinjin Cai1Fuhai Li2Zachary A. Vesoulis3Washington University School of MedicineDivision of Biostatistics, Washington University School of MedicineInstitute for Informatics, Washington UniversityDepartment of Pediatrics, Division of Newborn Medicine, Washington University School of MedicineAbstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical data were included in this retrospective study. Mortality risk was quantified by conventional means (clinical factors) using the CRIB-II score and the optimal logistic regression model. A random forest (RF) model was trained using a subset of the cohort, labeling data within 6 h of death as “worry.” The model was then tested using the remaining infants. A total of 275 infants were included in the study with a mean gestational age of 27 weeks, mean birth weight of 929 g, 49% female, and a mortality rate of 21%. The CRIB-II and logistic regression models had acceptable performance with sensitivities of 71% and 80% AUC scores of 0.78 and 0.84, respectively. The RF model had superior performance with a sensitivity of 88% and an AUC of 0.93. A random forest model which incorporates fixed clinical factors with the influence of aberrancies in subsequent physiology has superior performance for mortality prediction compared to conventional models.https://doi.org/10.1038/s41598-021-86748-4 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jennifer Lee Jinjin Cai Fuhai Li Zachary A. Vesoulis |
spellingShingle |
Jennifer Lee Jinjin Cai Fuhai Li Zachary A. Vesoulis Predicting mortality risk for preterm infants using random forest Scientific Reports |
author_facet |
Jennifer Lee Jinjin Cai Fuhai Li Zachary A. Vesoulis |
author_sort |
Jennifer Lee |
title |
Predicting mortality risk for preterm infants using random forest |
title_short |
Predicting mortality risk for preterm infants using random forest |
title_full |
Predicting mortality risk for preterm infants using random forest |
title_fullStr |
Predicting mortality risk for preterm infants using random forest |
title_full_unstemmed |
Predicting mortality risk for preterm infants using random forest |
title_sort |
predicting mortality risk for preterm infants using random forest |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-03-01 |
description |
Abstract Mortality is an unfortunately common outcome of extremely and very preterm birth. Existing clinical prediction models capture mortality risk at birth but fail to account for the remainder of the hospital course. Infants born < 32 weeks gestation with complete physiologic and clinical data were included in this retrospective study. Mortality risk was quantified by conventional means (clinical factors) using the CRIB-II score and the optimal logistic regression model. A random forest (RF) model was trained using a subset of the cohort, labeling data within 6 h of death as “worry.” The model was then tested using the remaining infants. A total of 275 infants were included in the study with a mean gestational age of 27 weeks, mean birth weight of 929 g, 49% female, and a mortality rate of 21%. The CRIB-II and logistic regression models had acceptable performance with sensitivities of 71% and 80% AUC scores of 0.78 and 0.84, respectively. The RF model had superior performance with a sensitivity of 88% and an AUC of 0.93. A random forest model which incorporates fixed clinical factors with the influence of aberrancies in subsequent physiology has superior performance for mortality prediction compared to conventional models. |
url |
https://doi.org/10.1038/s41598-021-86748-4 |
work_keys_str_mv |
AT jenniferlee predictingmortalityriskforpreterminfantsusingrandomforest AT jinjincai predictingmortalityriskforpreterminfantsusingrandomforest AT fuhaili predictingmortalityriskforpreterminfantsusingrandomforest AT zacharyavesoulis predictingmortalityriskforpreterminfantsusingrandomforest |
_version_ |
1721542646101442560 |