Predictive inference with random forests: A new perspective on classical analyses
Despite the number of problems that can occur when core model assumptions are violated, nearly all quantitative political science research relies on inflexible regression models that require a linear relationship between dependent and independent variables for valid inference. We argue that nonparam...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2020-02-01
|
Series: | Research & Politics |
Online Access: | https://doi.org/10.1177/2053168020905487 |
id |
doaj-9c338c60ff2d450292c9701be83a5f20 |
---|---|
record_format |
Article |
spelling |
doaj-9c338c60ff2d450292c9701be83a5f202020-11-25T03:56:12ZengSAGE PublishingResearch & Politics2053-16802020-02-01710.1177/2053168020905487Predictive inference with random forests: A new perspective on classical analysesRichard J. McAlexander0Lucas Mentch1Columbia University, USAUniversity of Pittsburgh, USADespite the number of problems that can occur when core model assumptions are violated, nearly all quantitative political science research relies on inflexible regression models that require a linear relationship between dependent and independent variables for valid inference. We argue that nonparametric statistical learning methods like random forests are capable of combining the benefits of interpretability and flexibility. Recent work has shown that under suitable regularity conditions, averaging over predictions made by subsampled random forests produces asymptotically normal predictions. After estimating the variance, this property can be exploited to produce hypothesis tests and confidence intervals analogous to those produced within a parametric framework. We demonstrated the utility of this approach by replicating an important study on the determinants of civil war onset and show that subtle nonlinear relationships are uncovered, providing a new perspective on these ongoing research questions.https://doi.org/10.1177/2053168020905487 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Richard J. McAlexander Lucas Mentch |
spellingShingle |
Richard J. McAlexander Lucas Mentch Predictive inference with random forests: A new perspective on classical analyses Research & Politics |
author_facet |
Richard J. McAlexander Lucas Mentch |
author_sort |
Richard J. McAlexander |
title |
Predictive inference with random forests: A new perspective on classical analyses |
title_short |
Predictive inference with random forests: A new perspective on classical analyses |
title_full |
Predictive inference with random forests: A new perspective on classical analyses |
title_fullStr |
Predictive inference with random forests: A new perspective on classical analyses |
title_full_unstemmed |
Predictive inference with random forests: A new perspective on classical analyses |
title_sort |
predictive inference with random forests: a new perspective on classical analyses |
publisher |
SAGE Publishing |
series |
Research & Politics |
issn |
2053-1680 |
publishDate |
2020-02-01 |
description |
Despite the number of problems that can occur when core model assumptions are violated, nearly all quantitative political science research relies on inflexible regression models that require a linear relationship between dependent and independent variables for valid inference. We argue that nonparametric statistical learning methods like random forests are capable of combining the benefits of interpretability and flexibility. Recent work has shown that under suitable regularity conditions, averaging over predictions made by subsampled random forests produces asymptotically normal predictions. After estimating the variance, this property can be exploited to produce hypothesis tests and confidence intervals analogous to those produced within a parametric framework. We demonstrated the utility of this approach by replicating an important study on the determinants of civil war onset and show that subtle nonlinear relationships are uncovered, providing a new perspective on these ongoing research questions. |
url |
https://doi.org/10.1177/2053168020905487 |
work_keys_str_mv |
AT richardjmcalexander predictiveinferencewithrandomforestsanewperspectiveonclassicalanalyses AT lucasmentch predictiveinferencewithrandomforestsanewperspectiveonclassicalanalyses |
_version_ |
1724466549165654016 |