Predicting the replicability of social science lab experiments.

We measure how accurately replication of experimental results can be predicted by black-box statistical models. With data from four large-scale replication projects in experimental psychology and economics, and techniques from machine learning, we train predictive models and study which variables dr...

Full description

Bibliographic Details
Main Authors: Adam Altmejd, Anna Dreber, Eskil Forsell, Juergen Huber, Taisuke Imai, Magnus Johannesson, Michael Kirchler, Gideon Nave, Colin Camerer
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0225826
id doaj-8bd9cacff3e64798b9387e61ea2df3c2
record_format Article
spelling doaj-8bd9cacff3e64798b9387e61ea2df3c22021-03-03T21:16:19ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-011412e022582610.1371/journal.pone.0225826Predicting the replicability of social science lab experiments.Adam AltmejdAnna DreberEskil ForsellJuergen HuberTaisuke ImaiMagnus JohannessonMichael KirchlerGideon NaveColin CamererWe measure how accurately replication of experimental results can be predicted by black-box statistical models. With data from four large-scale replication projects in experimental psychology and economics, and techniques from machine learning, we train predictive models and study which variables drive predictable replication. The models predicts binary replication with a cross-validated accuracy rate of 70% (AUC of 0.77) and estimates of relative effect sizes with a Spearman ρ of 0.38. The accuracy level is similar to market-aggregated beliefs of peer scientists [1, 2]. The predictive power is validated in a pre-registered out of sample test of the outcome of [3], where 71% (AUC of 0.73) of replications are predicted correctly and effect size correlations amount to ρ = 0.25. Basic features such as the sample and effect sizes in original papers, and whether reported effects are single-variable main effects or two-variable interactions, are predictive of successful replication. The models presented in this paper are simple tools to produce cheap, prognostic replicability metrics. These models could be useful in institutionalizing the process of evaluation of new findings and guiding resources to those direct replications that are likely to be most informative.https://doi.org/10.1371/journal.pone.0225826
collection DOAJ
language English
format Article
sources DOAJ
author Adam Altmejd
Anna Dreber
Eskil Forsell
Juergen Huber
Taisuke Imai
Magnus Johannesson
Michael Kirchler
Gideon Nave
Colin Camerer
spellingShingle Adam Altmejd
Anna Dreber
Eskil Forsell
Juergen Huber
Taisuke Imai
Magnus Johannesson
Michael Kirchler
Gideon Nave
Colin Camerer
Predicting the replicability of social science lab experiments.
PLoS ONE
author_facet Adam Altmejd
Anna Dreber
Eskil Forsell
Juergen Huber
Taisuke Imai
Magnus Johannesson
Michael Kirchler
Gideon Nave
Colin Camerer
author_sort Adam Altmejd
title Predicting the replicability of social science lab experiments.
title_short Predicting the replicability of social science lab experiments.
title_full Predicting the replicability of social science lab experiments.
title_fullStr Predicting the replicability of social science lab experiments.
title_full_unstemmed Predicting the replicability of social science lab experiments.
title_sort predicting the replicability of social science lab experiments.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2019-01-01
description We measure how accurately replication of experimental results can be predicted by black-box statistical models. With data from four large-scale replication projects in experimental psychology and economics, and techniques from machine learning, we train predictive models and study which variables drive predictable replication. The models predicts binary replication with a cross-validated accuracy rate of 70% (AUC of 0.77) and estimates of relative effect sizes with a Spearman ρ of 0.38. The accuracy level is similar to market-aggregated beliefs of peer scientists [1, 2]. The predictive power is validated in a pre-registered out of sample test of the outcome of [3], where 71% (AUC of 0.73) of replications are predicted correctly and effect size correlations amount to ρ = 0.25. Basic features such as the sample and effect sizes in original papers, and whether reported effects are single-variable main effects or two-variable interactions, are predictive of successful replication. The models presented in this paper are simple tools to produce cheap, prognostic replicability metrics. These models could be useful in institutionalizing the process of evaluation of new findings and guiding resources to those direct replications that are likely to be most informative.
url https://doi.org/10.1371/journal.pone.0225826
work_keys_str_mv AT adamaltmejd predictingthereplicabilityofsocialsciencelabexperiments
AT annadreber predictingthereplicabilityofsocialsciencelabexperiments
AT eskilforsell predictingthereplicabilityofsocialsciencelabexperiments
AT juergenhuber predictingthereplicabilityofsocialsciencelabexperiments
AT taisukeimai predictingthereplicabilityofsocialsciencelabexperiments
AT magnusjohannesson predictingthereplicabilityofsocialsciencelabexperiments
AT michaelkirchler predictingthereplicabilityofsocialsciencelabexperiments
AT gideonnave predictingthereplicabilityofsocialsciencelabexperiments
AT colincamerer predictingthereplicabilityofsocialsciencelabexperiments
_version_ 1714817841274814464