Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis

BackgroundPredictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. ObjectiveThis study aims to review and compare...

Full description

Bibliographic Details
Main Authors: Sufriyana, Herdiantri, Husnayain, Atina, Chen, Ya-Lin, Kuo, Chao-Yang, Singh, Onkar, Yeh, Tso-Yang, Wu, Yu-Wei, Su, Emily Chia-Yu
Format: Article
Language:English
Published: JMIR Publications 2020-11-01
Series:JMIR Medical Informatics
Online Access:http://medinform.jmir.org/2020/11/e16503/
id doaj-379605b1d5724ad2bc3cfd5cd6e860fd
record_format Article
spelling doaj-379605b1d5724ad2bc3cfd5cd6e860fd2021-05-03T04:37:35ZengJMIR PublicationsJMIR Medical Informatics2291-96942020-11-01811e1650310.2196/16503Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-AnalysisSufriyana, HerdiantriHusnayain, AtinaChen, Ya-LinKuo, Chao-YangSingh, OnkarYeh, Tso-YangWu, Yu-WeiSu, Emily Chia-Yu BackgroundPredictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. ObjectiveThis study aims to review and compare the predictive performances between logistic regression (LR) and other machine learning algorithms for developing or validating a multivariable prognostic prediction model for pregnancy care to inform clinicians’ decision making. MethodsResearch articles from MEDLINE, Scopus, Web of Science, and Google Scholar were reviewed following several guidelines for a prognostic prediction study, including a risk of bias (ROB) assessment. We report the results based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Studies were primarily framed as PICOTS (population, index, comparator, outcomes, timing, and setting): Population: men or women in procreative management, pregnant women, and fetuses or newborns; Index: multivariable prognostic prediction models using non-LR algorithms for risk classification to inform clinicians’ decision making; Comparator: the models applying an LR; Outcomes: pregnancy-related outcomes of procreation or pregnancy outcomes for pregnant women and fetuses or newborns; Timing: pre-, inter-, and peripregnancy periods (predictors), at the pregnancy, delivery, and either puerperal or neonatal period (outcome), and either short- or long-term prognoses (time interval); and Setting: primary care or hospital. The results were synthesized by reporting study characteristics and ROBs and by random effects modeling of the difference of the logit area under the receiver operating characteristic curve of each non-LR model compared with the LR model for the same pregnancy outcomes. We also reported between-study heterogeneity by using τ2 and I2. ResultsOf the 2093 records, we included 142 studies for the systematic review and 62 studies for a meta-analysis. Most prediction models used LR (92/142, 64.8%) and artificial neural networks (20/142, 14.1%) among non-LR algorithms. Only 16.9% (24/142) of studies had a low ROB. A total of 2 non-LR algorithms from low ROB studies significantly outperformed LR. The first algorithm was a random forest for preterm delivery (logit AUROC 2.51, 95% CI 1.49-3.53; I2=86%; τ2=0.77) and pre-eclampsia (logit AUROC 1.2, 95% CI 0.72-1.67; I2=75%; τ2=0.09). The second algorithm was gradient boosting for cesarean section (logit AUROC 2.26, 95% CI 1.39-3.13; I2=75%; τ2=0.43) and gestational diabetes (logit AUROC 1.03, 95% CI 0.69-1.37; I2=83%; τ2=0.07). ConclusionsPrediction models with the best performances across studies were not necessarily those that used LR but also used random forest and gradient boosting that also performed well. We recommend a reanalysis of existing LR models for several pregnancy outcomes by comparing them with those algorithms that apply standard guidelines. Trial RegistrationPROSPERO (International Prospective Register of Systematic Reviews) CRD42019136106; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=136106http://medinform.jmir.org/2020/11/e16503/
collection DOAJ
language English
format Article
sources DOAJ
author Sufriyana, Herdiantri
Husnayain, Atina
Chen, Ya-Lin
Kuo, Chao-Yang
Singh, Onkar
Yeh, Tso-Yang
Wu, Yu-Wei
Su, Emily Chia-Yu
spellingShingle Sufriyana, Herdiantri
Husnayain, Atina
Chen, Ya-Lin
Kuo, Chao-Yang
Singh, Onkar
Yeh, Tso-Yang
Wu, Yu-Wei
Su, Emily Chia-Yu
Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
JMIR Medical Informatics
author_facet Sufriyana, Herdiantri
Husnayain, Atina
Chen, Ya-Lin
Kuo, Chao-Yang
Singh, Onkar
Yeh, Tso-Yang
Wu, Yu-Wei
Su, Emily Chia-Yu
author_sort Sufriyana, Herdiantri
title Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_short Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_full Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_fullStr Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_full_unstemmed Comparison of Multivariable Logistic Regression and Other Machine Learning Algorithms for Prognostic Prediction Studies in Pregnancy Care: Systematic Review and Meta-Analysis
title_sort comparison of multivariable logistic regression and other machine learning algorithms for prognostic prediction studies in pregnancy care: systematic review and meta-analysis
publisher JMIR Publications
series JMIR Medical Informatics
issn 2291-9694
publishDate 2020-11-01
description BackgroundPredictions in pregnancy care are complex because of interactions among multiple factors. Hence, pregnancy outcomes are not easily predicted by a single predictor using only one algorithm or modeling method. ObjectiveThis study aims to review and compare the predictive performances between logistic regression (LR) and other machine learning algorithms for developing or validating a multivariable prognostic prediction model for pregnancy care to inform clinicians’ decision making. MethodsResearch articles from MEDLINE, Scopus, Web of Science, and Google Scholar were reviewed following several guidelines for a prognostic prediction study, including a risk of bias (ROB) assessment. We report the results based on the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines. Studies were primarily framed as PICOTS (population, index, comparator, outcomes, timing, and setting): Population: men or women in procreative management, pregnant women, and fetuses or newborns; Index: multivariable prognostic prediction models using non-LR algorithms for risk classification to inform clinicians’ decision making; Comparator: the models applying an LR; Outcomes: pregnancy-related outcomes of procreation or pregnancy outcomes for pregnant women and fetuses or newborns; Timing: pre-, inter-, and peripregnancy periods (predictors), at the pregnancy, delivery, and either puerperal or neonatal period (outcome), and either short- or long-term prognoses (time interval); and Setting: primary care or hospital. The results were synthesized by reporting study characteristics and ROBs and by random effects modeling of the difference of the logit area under the receiver operating characteristic curve of each non-LR model compared with the LR model for the same pregnancy outcomes. We also reported between-study heterogeneity by using τ2 and I2. ResultsOf the 2093 records, we included 142 studies for the systematic review and 62 studies for a meta-analysis. Most prediction models used LR (92/142, 64.8%) and artificial neural networks (20/142, 14.1%) among non-LR algorithms. Only 16.9% (24/142) of studies had a low ROB. A total of 2 non-LR algorithms from low ROB studies significantly outperformed LR. The first algorithm was a random forest for preterm delivery (logit AUROC 2.51, 95% CI 1.49-3.53; I2=86%; τ2=0.77) and pre-eclampsia (logit AUROC 1.2, 95% CI 0.72-1.67; I2=75%; τ2=0.09). The second algorithm was gradient boosting for cesarean section (logit AUROC 2.26, 95% CI 1.39-3.13; I2=75%; τ2=0.43) and gestational diabetes (logit AUROC 1.03, 95% CI 0.69-1.37; I2=83%; τ2=0.07). ConclusionsPrediction models with the best performances across studies were not necessarily those that used LR but also used random forest and gradient boosting that also performed well. We recommend a reanalysis of existing LR models for several pregnancy outcomes by comparing them with those algorithms that apply standard guidelines. Trial RegistrationPROSPERO (International Prospective Register of Systematic Reviews) CRD42019136106; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=136106
url http://medinform.jmir.org/2020/11/e16503/
work_keys_str_mv AT sufriyanaherdiantri comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT husnayainatina comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT chenyalin comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT kuochaoyang comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT singhonkar comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT yehtsoyang comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT wuyuwei comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
AT suemilychiayu comparisonofmultivariablelogisticregressionandothermachinelearningalgorithmsforprognosticpredictionstudiesinpregnancycaresystematicreviewandmetaanalysis
_version_ 1721483740229664768