Predicting the replicability of social science lab experiments.
We measure how accurately replication of experimental results can be predicted by black-box statistical models. With data from four large-scale replication projects in experimental psychology and economics, and techniques from machine learning, we train predictive models and study which variables dr...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2019-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0225826 |
id |
doaj-8bd9cacff3e64798b9387e61ea2df3c2 |
---|---|
record_format |
Article |
spelling |
doaj-8bd9cacff3e64798b9387e61ea2df3c22021-03-03T21:16:19ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-011412e022582610.1371/journal.pone.0225826Predicting the replicability of social science lab experiments.Adam AltmejdAnna DreberEskil ForsellJuergen HuberTaisuke ImaiMagnus JohannessonMichael KirchlerGideon NaveColin CamererWe measure how accurately replication of experimental results can be predicted by black-box statistical models. With data from four large-scale replication projects in experimental psychology and economics, and techniques from machine learning, we train predictive models and study which variables drive predictable replication. The models predicts binary replication with a cross-validated accuracy rate of 70% (AUC of 0.77) and estimates of relative effect sizes with a Spearman ρ of 0.38. The accuracy level is similar to market-aggregated beliefs of peer scientists [1, 2]. The predictive power is validated in a pre-registered out of sample test of the outcome of [3], where 71% (AUC of 0.73) of replications are predicted correctly and effect size correlations amount to ρ = 0.25. Basic features such as the sample and effect sizes in original papers, and whether reported effects are single-variable main effects or two-variable interactions, are predictive of successful replication. The models presented in this paper are simple tools to produce cheap, prognostic replicability metrics. These models could be useful in institutionalizing the process of evaluation of new findings and guiding resources to those direct replications that are likely to be most informative.https://doi.org/10.1371/journal.pone.0225826 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Adam Altmejd Anna Dreber Eskil Forsell Juergen Huber Taisuke Imai Magnus Johannesson Michael Kirchler Gideon Nave Colin Camerer |
spellingShingle |
Adam Altmejd Anna Dreber Eskil Forsell Juergen Huber Taisuke Imai Magnus Johannesson Michael Kirchler Gideon Nave Colin Camerer Predicting the replicability of social science lab experiments. PLoS ONE |
author_facet |
Adam Altmejd Anna Dreber Eskil Forsell Juergen Huber Taisuke Imai Magnus Johannesson Michael Kirchler Gideon Nave Colin Camerer |
author_sort |
Adam Altmejd |
title |
Predicting the replicability of social science lab experiments. |
title_short |
Predicting the replicability of social science lab experiments. |
title_full |
Predicting the replicability of social science lab experiments. |
title_fullStr |
Predicting the replicability of social science lab experiments. |
title_full_unstemmed |
Predicting the replicability of social science lab experiments. |
title_sort |
predicting the replicability of social science lab experiments. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2019-01-01 |
description |
We measure how accurately replication of experimental results can be predicted by black-box statistical models. With data from four large-scale replication projects in experimental psychology and economics, and techniques from machine learning, we train predictive models and study which variables drive predictable replication. The models predicts binary replication with a cross-validated accuracy rate of 70% (AUC of 0.77) and estimates of relative effect sizes with a Spearman ρ of 0.38. The accuracy level is similar to market-aggregated beliefs of peer scientists [1, 2]. The predictive power is validated in a pre-registered out of sample test of the outcome of [3], where 71% (AUC of 0.73) of replications are predicted correctly and effect size correlations amount to ρ = 0.25. Basic features such as the sample and effect sizes in original papers, and whether reported effects are single-variable main effects or two-variable interactions, are predictive of successful replication. The models presented in this paper are simple tools to produce cheap, prognostic replicability metrics. These models could be useful in institutionalizing the process of evaluation of new findings and guiding resources to those direct replications that are likely to be most informative. |
url |
https://doi.org/10.1371/journal.pone.0225826 |
work_keys_str_mv |
AT adamaltmejd predictingthereplicabilityofsocialsciencelabexperiments AT annadreber predictingthereplicabilityofsocialsciencelabexperiments AT eskilforsell predictingthereplicabilityofsocialsciencelabexperiments AT juergenhuber predictingthereplicabilityofsocialsciencelabexperiments AT taisukeimai predictingthereplicabilityofsocialsciencelabexperiments AT magnusjohannesson predictingthereplicabilityofsocialsciencelabexperiments AT michaelkirchler predictingthereplicabilityofsocialsciencelabexperiments AT gideonnave predictingthereplicabilityofsocialsciencelabexperiments AT colincamerer predictingthereplicabilityofsocialsciencelabexperiments |
_version_ |
1714817841274814464 |