Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks

Abstract Background Ensemble modeling aims to boost the forecasting performance by systematically integrating the predictive accuracy across individual models. Here we introduce a simple-yet-powerful ensemble methodology for forecasting the trajectory of dynamic growth processes that are defined by...

Full description

Bibliographic Details
Main Authors: Gerardo Chowell, Ruiyan Luo
Format: Article
Language:English
Published: BMC 2021-02-01
Series:BMC Medical Research Methodology
Subjects:
Online Access:https://doi.org/10.1186/s12874-021-01226-9
id doaj-9f36272221b04c2299882bfefa40b4fb
record_format Article
spelling doaj-9f36272221b04c2299882bfefa40b4fb2021-02-21T12:03:05ZengBMCBMC Medical Research Methodology1471-22882021-02-0121111810.1186/s12874-021-01226-9Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaksGerardo Chowell0Ruiyan Luo1Department of Population Heath Sciences, School of Public Health, Georgia State UniversityDepartment of Population Heath Sciences, School of Public Health, Georgia State UniversityAbstract Background Ensemble modeling aims to boost the forecasting performance by systematically integrating the predictive accuracy across individual models. Here we introduce a simple-yet-powerful ensemble methodology for forecasting the trajectory of dynamic growth processes that are defined by a system of non-linear differential equations with applications to infectious disease spread. Methods We propose and assess the performance of two ensemble modeling schemes with different parametric bootstrapping procedures for trajectory forecasting and uncertainty quantification. Specifically, we conduct sequential probabilistic forecasts to evaluate their forecasting performance using simple dynamical growth models with good track records including the Richards model, the generalized-logistic growth model, and the Gompertz model. We first test and verify the functionality of the method using simulated data from phenomenological models and a mechanistic transmission model. Next, the performance of the method is demonstrated using a diversity of epidemic datasets including scenario outbreak data of the Ebola Forecasting Challenge and real-world epidemic data outbreaks of including influenza, plague, Zika, and COVID-19. Results We found that the ensemble method that randomly selects a model from the set of individual models for each time point of the trajectory of the epidemic frequently outcompeted the individual models as well as an alternative ensemble method based on the weighted combination of the individual models and yields broader and more realistic uncertainty bounds for the trajectory envelope, achieving not only better coverage rate of the 95% prediction interval but also improved mean interval scores across a diversity of epidemic datasets. Conclusion Our new methodology for ensemble forecasting outcompete component models and an alternative ensemble model that differ in how the variance is evaluated for the generation of the prediction intervals of the forecasts.https://doi.org/10.1186/s12874-021-01226-9Model ensemble, parameter estimation, uncertainty quantification, phenomenological growthDifferential equationsGeneralized logistic growth modelRichards modelGompertz modelInterval score
collection DOAJ
language English
format Article
sources DOAJ
author Gerardo Chowell
Ruiyan Luo
spellingShingle Gerardo Chowell
Ruiyan Luo
Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
BMC Medical Research Methodology
Model ensemble, parameter estimation, uncertainty quantification, phenomenological growth
Differential equations
Generalized logistic growth model
Richards model
Gompertz model
Interval score
author_facet Gerardo Chowell
Ruiyan Luo
author_sort Gerardo Chowell
title Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
title_short Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
title_full Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
title_fullStr Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
title_full_unstemmed Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
title_sort ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
publisher BMC
series BMC Medical Research Methodology
issn 1471-2288
publishDate 2021-02-01
description Abstract Background Ensemble modeling aims to boost the forecasting performance by systematically integrating the predictive accuracy across individual models. Here we introduce a simple-yet-powerful ensemble methodology for forecasting the trajectory of dynamic growth processes that are defined by a system of non-linear differential equations with applications to infectious disease spread. Methods We propose and assess the performance of two ensemble modeling schemes with different parametric bootstrapping procedures for trajectory forecasting and uncertainty quantification. Specifically, we conduct sequential probabilistic forecasts to evaluate their forecasting performance using simple dynamical growth models with good track records including the Richards model, the generalized-logistic growth model, and the Gompertz model. We first test and verify the functionality of the method using simulated data from phenomenological models and a mechanistic transmission model. Next, the performance of the method is demonstrated using a diversity of epidemic datasets including scenario outbreak data of the Ebola Forecasting Challenge and real-world epidemic data outbreaks of including influenza, plague, Zika, and COVID-19. Results We found that the ensemble method that randomly selects a model from the set of individual models for each time point of the trajectory of the epidemic frequently outcompeted the individual models as well as an alternative ensemble method based on the weighted combination of the individual models and yields broader and more realistic uncertainty bounds for the trajectory envelope, achieving not only better coverage rate of the 95% prediction interval but also improved mean interval scores across a diversity of epidemic datasets. Conclusion Our new methodology for ensemble forecasting outcompete component models and an alternative ensemble model that differ in how the variance is evaluated for the generation of the prediction intervals of the forecasts.
topic Model ensemble, parameter estimation, uncertainty quantification, phenomenological growth
Differential equations
Generalized logistic growth model
Richards model
Gompertz model
Interval score
url https://doi.org/10.1186/s12874-021-01226-9
work_keys_str_mv AT gerardochowell ensemblebootstrapmethodologyforforecastingdynamicgrowthprocessesusingdifferentialequationsapplicationtoepidemicoutbreaks
AT ruiyanluo ensemblebootstrapmethodologyforforecastingdynamicgrowthprocessesusingdifferentialequationsapplicationtoepidemicoutbreaks
_version_ 1724258534028214272