Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis
Abstract Meta‐analyses often encounter studies with incompletely reported variance measures (e.g., standard deviation values) or sample sizes, both needed to conduct weighted meta‐analyses. Here, we first present a systematic literature survey on the frequency and treatment of missing data in publis...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2020-10-01
|
Series: | Ecology and Evolution |
Subjects: | |
Online Access: | https://doi.org/10.1002/ece3.6806 |
id |
doaj-c0d72734a4964cce94cd84282e52f9b2 |
---|---|
record_format |
Article |
spelling |
doaj-c0d72734a4964cce94cd84282e52f9b22021-04-02T19:07:59ZengWileyEcology and Evolution2045-77582020-10-011020116991171210.1002/ece3.6806Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysisStephan Kambach0Helge Bruelheide1Katharina Gerstner2Jessica Gurevitch3Michael Beckmann4Ralf Seppelt5German Centre for Integrative Biodiversity Research (iDiv) Halle‐Jena‐Leipzig Leipzig GermanyInstitute of Biology/Geobotany and Botanical Garden Martin Luther University Halle‐Wittenberg Halle GermanyGerman Centre for Integrative Biodiversity Research (iDiv) Halle‐Jena‐Leipzig Leipzig GermanyDepartment of Ecology and Evolution Stony Brook University Stony Brook NY USADepartment Computational Landscape Ecology UFZ – Helmholtz Centre for Environmental Research Leipzig GermanyDepartment Computational Landscape Ecology UFZ – Helmholtz Centre for Environmental Research Leipzig GermanyAbstract Meta‐analyses often encounter studies with incompletely reported variance measures (e.g., standard deviation values) or sample sizes, both needed to conduct weighted meta‐analyses. Here, we first present a systematic literature survey on the frequency and treatment of missing data in published ecological meta‐analyses showing that the majority of meta‐analyses encountered incompletely reported studies. We then simulated meta‐analysis data sets to investigate the performance of 14 options to treat or impute missing SDs and/or SSs. Performance was thereby assessed using results from fully informed weighted analyses on (hypothetically) complete data sets. We show that the omission of incompletely reported studies is not a viable solution. Unweighted and sample size‐based variance approximation can yield unbiased grand means if effect sizes are independent of their corresponding SDs and SSs. The performance of different imputation methods depends on the structure of the meta‐analysis data set, especially in the case of correlated effect sizes and standard deviations or sample sizes. In a best‐case scenario, which assumes that SDs and/or SSs are both missing at random and are unrelated to effect sizes, our simulations show that the imputation of up to 90% of missing data still yields grand means and confidence intervals that are similar to those obtained with fully informed weighted analyses. We conclude that multiple imputation of missing variance measures and sample sizes could help overcome the problem of incompletely reported primary studies, not only in the field of ecological meta‐analyses. Still, caution must be exercised in consideration of potential correlations and pattern of missingness.https://doi.org/10.1002/ece3.6806effect sizesmissing not at randomrecommendationsresearch synthesissimulated data setsvariance measures |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Stephan Kambach Helge Bruelheide Katharina Gerstner Jessica Gurevitch Michael Beckmann Ralf Seppelt |
spellingShingle |
Stephan Kambach Helge Bruelheide Katharina Gerstner Jessica Gurevitch Michael Beckmann Ralf Seppelt Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis Ecology and Evolution effect sizes missing not at random recommendations research synthesis simulated data sets variance measures |
author_facet |
Stephan Kambach Helge Bruelheide Katharina Gerstner Jessica Gurevitch Michael Beckmann Ralf Seppelt |
author_sort |
Stephan Kambach |
title |
Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis |
title_short |
Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis |
title_full |
Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis |
title_fullStr |
Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis |
title_full_unstemmed |
Consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis |
title_sort |
consequences of multiple imputation of missing standard deviations and sample sizes in meta‐analysis |
publisher |
Wiley |
series |
Ecology and Evolution |
issn |
2045-7758 |
publishDate |
2020-10-01 |
description |
Abstract Meta‐analyses often encounter studies with incompletely reported variance measures (e.g., standard deviation values) or sample sizes, both needed to conduct weighted meta‐analyses. Here, we first present a systematic literature survey on the frequency and treatment of missing data in published ecological meta‐analyses showing that the majority of meta‐analyses encountered incompletely reported studies. We then simulated meta‐analysis data sets to investigate the performance of 14 options to treat or impute missing SDs and/or SSs. Performance was thereby assessed using results from fully informed weighted analyses on (hypothetically) complete data sets. We show that the omission of incompletely reported studies is not a viable solution. Unweighted and sample size‐based variance approximation can yield unbiased grand means if effect sizes are independent of their corresponding SDs and SSs. The performance of different imputation methods depends on the structure of the meta‐analysis data set, especially in the case of correlated effect sizes and standard deviations or sample sizes. In a best‐case scenario, which assumes that SDs and/or SSs are both missing at random and are unrelated to effect sizes, our simulations show that the imputation of up to 90% of missing data still yields grand means and confidence intervals that are similar to those obtained with fully informed weighted analyses. We conclude that multiple imputation of missing variance measures and sample sizes could help overcome the problem of incompletely reported primary studies, not only in the field of ecological meta‐analyses. Still, caution must be exercised in consideration of potential correlations and pattern of missingness. |
topic |
effect sizes missing not at random recommendations research synthesis simulated data sets variance measures |
url |
https://doi.org/10.1002/ece3.6806 |
work_keys_str_mv |
AT stephankambach consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis AT helgebruelheide consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis AT katharinagerstner consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis AT jessicagurevitch consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis AT michaelbeckmann consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis AT ralfseppelt consequencesofmultipleimputationofmissingstandarddeviationsandsamplesizesinmetaanalysis |
_version_ |
1721549548373934080 |