Prediction models for clustered data with informative priors for the random effects: a simulation study

Abstract Background Random effects modelling is routinely used in clustered data, but for prediction models, random effects are commonly substituted with the mean zero after model development. In this study, we proposed a novel approach of including prior knowledge through the random effects distrib...

Full description

Bibliographic Details
Main Authors: Haifang Ni, Rolf H. H. Groenwold, Mirjam Nielen, Irene Klugkist
Format: Article
Language:English
Published: BMC 2018-08-01
Series:BMC Medical Research Methodology
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12874-018-0543-5
id doaj-e1d5c23adce04684b57bfc51d5843193
record_format Article
spelling doaj-e1d5c23adce04684b57bfc51d58431932020-11-25T01:38:01ZengBMCBMC Medical Research Methodology1471-22882018-08-0118111010.1186/s12874-018-0543-5Prediction models for clustered data with informative priors for the random effects: a simulation studyHaifang Ni0Rolf H. H. Groenwold1Mirjam Nielen2Irene Klugkist3Department of Methodology and Statistics, Faculty of Social and Behavioral Sciences, Utrecht UniversityDepartment of Epidemiology, Julius Center for Health Sciences and Primary Care, University Medical Center UtrechtDepartment of Farm Animal Health, Faculty of Veterinary Medicine, Utrecht UniversityDepartment of Methodology and Statistics, Faculty of Social and Behavioral Sciences, Utrecht UniversityAbstract Background Random effects modelling is routinely used in clustered data, but for prediction models, random effects are commonly substituted with the mean zero after model development. In this study, we proposed a novel approach of including prior knowledge through the random effects distribution and investigated to what extent this could improve the predictive performance. Methods Data were simulated on the basis of a random effects logistic regression model. Five prediction models were specified: a frequentist model that set the random effects to zero for all new clusters, a Bayesian model with weakly informative priors for the random effects of new clusters, Bayesian models with expert opinion incorporated into low informative, medium informative and highly informative priors for the random effects. Expert opinion at the cluster level was elicited in the form of a truncated area of the random effects distribution. The predictive performance of the five models was assessed. In addition, impact of suboptimal expert opinion that deviated from the true quantity as well as including expert opinion by means of a categorical variable in the frequentist approach were explored. The five models were further investigated in various sensitivity analyses. Results The Bayesian prediction model using weakly informative priors for the random effects showed similar results to the frequentist model. Bayesian prediction models using expert opinion as informative priors showed smaller Brier scores, better overall discrimination and calibration, as well as better within cluster calibration. Results also indicated that incorporation of more precise expert opinion led to better predictions. Predictive performance from the frequentist models with expert opinion incorporated as categorical variable showed similar patterns as the Bayesian models with informative priors. When suboptimal expert opinion was used as prior information, results indicated that prediction still improved in certain settings. Conclusions The prediction models that incorporated cluster level information showed better performance than the models that did not. The Bayesian prediction models we proposed, with cluster specific expert opinion incorporated as priors for the random effects showed better predictive ability in new data, compared to the frequentist method that replaced random effects with zero after model development.http://link.springer.com/article/10.1186/s12874-018-0543-5Random effects prediction modelClustered dataInformative priors for the random effectsExpert knowledgeTruncated distribution
collection DOAJ
language English
format Article
sources DOAJ
author Haifang Ni
Rolf H. H. Groenwold
Mirjam Nielen
Irene Klugkist
spellingShingle Haifang Ni
Rolf H. H. Groenwold
Mirjam Nielen
Irene Klugkist
Prediction models for clustered data with informative priors for the random effects: a simulation study
BMC Medical Research Methodology
Random effects prediction model
Clustered data
Informative priors for the random effects
Expert knowledge
Truncated distribution
author_facet Haifang Ni
Rolf H. H. Groenwold
Mirjam Nielen
Irene Klugkist
author_sort Haifang Ni
title Prediction models for clustered data with informative priors for the random effects: a simulation study
title_short Prediction models for clustered data with informative priors for the random effects: a simulation study
title_full Prediction models for clustered data with informative priors for the random effects: a simulation study
title_fullStr Prediction models for clustered data with informative priors for the random effects: a simulation study
title_full_unstemmed Prediction models for clustered data with informative priors for the random effects: a simulation study
title_sort prediction models for clustered data with informative priors for the random effects: a simulation study
publisher BMC
series BMC Medical Research Methodology
issn 1471-2288
publishDate 2018-08-01
description Abstract Background Random effects modelling is routinely used in clustered data, but for prediction models, random effects are commonly substituted with the mean zero after model development. In this study, we proposed a novel approach of including prior knowledge through the random effects distribution and investigated to what extent this could improve the predictive performance. Methods Data were simulated on the basis of a random effects logistic regression model. Five prediction models were specified: a frequentist model that set the random effects to zero for all new clusters, a Bayesian model with weakly informative priors for the random effects of new clusters, Bayesian models with expert opinion incorporated into low informative, medium informative and highly informative priors for the random effects. Expert opinion at the cluster level was elicited in the form of a truncated area of the random effects distribution. The predictive performance of the five models was assessed. In addition, impact of suboptimal expert opinion that deviated from the true quantity as well as including expert opinion by means of a categorical variable in the frequentist approach were explored. The five models were further investigated in various sensitivity analyses. Results The Bayesian prediction model using weakly informative priors for the random effects showed similar results to the frequentist model. Bayesian prediction models using expert opinion as informative priors showed smaller Brier scores, better overall discrimination and calibration, as well as better within cluster calibration. Results also indicated that incorporation of more precise expert opinion led to better predictions. Predictive performance from the frequentist models with expert opinion incorporated as categorical variable showed similar patterns as the Bayesian models with informative priors. When suboptimal expert opinion was used as prior information, results indicated that prediction still improved in certain settings. Conclusions The prediction models that incorporated cluster level information showed better performance than the models that did not. The Bayesian prediction models we proposed, with cluster specific expert opinion incorporated as priors for the random effects showed better predictive ability in new data, compared to the frequentist method that replaced random effects with zero after model development.
topic Random effects prediction model
Clustered data
Informative priors for the random effects
Expert knowledge
Truncated distribution
url http://link.springer.com/article/10.1186/s12874-018-0543-5
work_keys_str_mv AT haifangni predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
AT rolfhhgroenwold predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
AT mirjamnielen predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
AT ireneklugkist predictionmodelsforclustereddatawithinformativepriorsfortherandomeffectsasimulationstudy
_version_ 1725055621243338752