Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis

Abstract Meta‐analyses often encounter studies with incompletely reported variance measures (e.g., standard deviation values) or sample sizes, both needed to conduct weighted meta‐analyses. Here, we first present a systematic literature survey on the frequency and treatment of missing data in publis...

Full description

Bibliographic Details
Main Authors: Stephan Kambach, Helge Bruelheide, Katharina Gerstner, Jessica Gurevitch, Michael Beckmann, Ralf Seppelt
Format: Article
Language:English
Published: Wiley 2020-10-01
Series:Ecology and Evolution
Subjects:
Online Access:https://doi.org/10.1002/ece3.6806
id doaj-c0d72734a4964cce94cd84282e52f9b2
record_format Article
spelling doaj-c0d72734a4964cce94cd84282e52f9b22021-04-02T19:07:59ZengWileyEcology and Evolution2045-77582020-10-011020116991171210.1002/ece3.6806Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysisStephan Kambach0Helge Bruelheide1Katharina Gerstner2Jessica Gurevitch3Michael Beckmann4Ralf Seppelt5German Centre for Integrative Biodiversity Research (iDiv) Halle‐Jena‐Leipzig Leipzig GermanyInstitute of Biology/Geobotany and Botanical Garden Martin Luther University Halle‐Wittenberg Halle GermanyGerman Centre for Integrative Biodiversity Research (iDiv) Halle‐Jena‐Leipzig Leipzig GermanyDepartment of Ecology and Evolution Stony Brook University Stony Brook NY USADepartment Computational Landscape Ecology UFZ – Helmholtz Centre for Environmental Research Leipzig GermanyDepartment Computational Landscape Ecology UFZ – Helmholtz Centre for Environmental Research Leipzig GermanyAbstract Meta‐analyses often encounter studies with incompletely reported variance measures (e.g., standard deviation values) or sample sizes, both needed to conduct weighted meta‐analyses. Here, we first present a systematic literature survey on the frequency and treatment of missing data in published ecological meta‐analyses showing that the majority of meta‐analyses encountered incompletely reported studies. We then simulated meta‐analysis data sets to investigate the performance of 14 options to treat or impute missing SDs and/or SSs. Performance was thereby assessed using results from fully informed weighted analyses on (hypothetically) complete data sets. We show that the omission of incompletely reported studies is not a viable solution. Unweighted and sample size‐based variance approximation can yield unbiased grand means if effect sizes are independent of their corresponding SDs and SSs. The performance of different imputation methods depends on the structure of the meta‐analysis data set, especially in the case of correlated effect sizes and standard deviations or sample sizes. In a best‐case scenario, which assumes that SDs and/or SSs are both missing at random and are unrelated to effect sizes, our simulations show that the imputation of up to 90% of missing data still yields grand means and confidence intervals that are similar to those obtained with fully informed weighted analyses. We conclude that multiple imputation of missing variance measures and sample sizes could help overcome the problem of incompletely reported primary studies, not only in the field of ecological meta‐analyses. Still, caution must be exercised in consideration of potential correlations and pattern of missingness.https://doi.org/10.1002/ece3.6806effect sizesmissing not at randomrecommendationsresearch synthesissimulated data setsvariance measures
collection DOAJ
language English
format Article
sources DOAJ
author Stephan Kambach
Helge Bruelheide
Katharina Gerstner
Jessica Gurevitch
Michael Beckmann
Ralf Seppelt
spellingShingle Stephan Kambach
Helge Bruelheide
Katharina Gerstner
Jessica Gurevitch
Michael Beckmann
Ralf Seppelt
Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
Ecology and Evolution
effect sizes
missing not at random
recommendations
research synthesis
simulated data sets
variance measures
author_facet Stephan Kambach
Helge Bruelheide
Katharina Gerstner
Jessica Gurevitch
Michael Beckmann
Ralf Seppelt
author_sort Stephan Kambach
title Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
title_short Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
title_full Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
title_fullStr Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
title_full_unstemmed Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
title_sort consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
publisher Wiley
series Ecology and Evolution
issn 2045-7758
publishDate 2020-10-01
description Abstract Meta‐analyses often encounter studies with incompletely reported variance measures (e.g., standard deviation values) or sample sizes, both needed to conduct weighted meta‐analyses. Here, we first present a systematic literature survey on the frequency and treatment of missing data in published ecological meta‐analyses showing that the majority of meta‐analyses encountered incompletely reported studies. We then simulated meta‐analysis data sets to investigate the performance of 14 options to treat or impute missing SDs and/or SSs. Performance was thereby assessed using results from fully informed weighted analyses on (hypothetically) complete data sets. We show that the omission of incompletely reported studies is not a viable solution. Unweighted and sample size‐based variance approximation can yield unbiased grand means if effect sizes are independent of their corresponding SDs and SSs. The performance of different imputation methods depends on the structure of the meta‐analysis data set, especially in the case of correlated effect sizes and standard deviations or sample sizes. In a best‐case scenario, which assumes that SDs and/or SSs are both missing at random and are unrelated to effect sizes, our simulations show that the imputation of up to 90% of missing data still yields grand means and confidence intervals that are similar to those obtained with fully informed weighted analyses. We conclude that multiple imputation of missing variance measures and sample sizes could help overcome the problem of incompletely reported primary studies, not only in the field of ecological meta‐analyses. Still, caution must be exercised in consideration of potential correlations and pattern of missingness.
topic effect sizes
missing not at random
recommendations
research synthesis
simulated data sets
variance measures
url https://doi.org/10.1002/ece3.6806
work_keys_str_mv AT stephankambach consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis
AT helgebruelheide consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis
AT katharinagerstner consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis
AT jessicagurevitch consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis
AT michaelbeckmann consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis
AT ralfseppelt consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis
_version_ 1721549548373934080