A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments

In this article, we study the causal inference problem with a continuous treatment variable using propensity score-based methods. For a continuous treatment, the generalized propensity score is defined as the conditional density of the treatment-level given covariates (confounders). The dose–respons...

Full description

Bibliographic Details
Main Authors: Zhu Yeying, Coffman Donna L., Ghosh Debashis
Format: Article
Language:English
Published: De Gruyter 2015-03-01
Series:Journal of Causal Inference
Subjects:
Online Access:https://doi.org/10.1515/jci-2014-0022
id doaj-d2ed925bc34d4eb6a62a710cb3929d56
record_format Article
spelling doaj-d2ed925bc34d4eb6a62a710cb3929d562021-09-06T19:40:28ZengDe GruyterJournal of Causal Inference2193-36772193-36852015-03-0131254010.1515/jci-2014-0022A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous TreatmentsZhu Yeying0Coffman Donna L.1Ghosh Debashis2Department of Statistics and Actuarial Science, University of Waterloo, 200 University Ave W, Waterloo, ON N2L 3G1, CanadaThe Methodology Center, The Pennsylvania State University, University Park, PA, USADepartment of Statistics and Public Health Sciences, The Pennsylvania State University, University Park, PA, USAIn this article, we study the causal inference problem with a continuous treatment variable using propensity score-based methods. For a continuous treatment, the generalized propensity score is defined as the conditional density of the treatment-level given covariates (confounders). The dose–response function is then estimated by inverse probability weighting, where the weights are calculated from the estimated propensity scores. When the dimension of the covariates is large, the traditional nonparametric density estimation suffers from the curse of dimensionality. Some researchers have suggested a two-step estimation procedure by first modeling the mean function. In this study, we suggest a boosting algorithm to estimate the mean function of the treatment given covariates. In boosting, an important tuning parameter is the number of trees to be generated, which essentially determines the trade-off between bias and variance of the causal estimator. We propose a criterion called average absolute correlation coefficient (AACC) to determine the optimal number of trees. Simulation results show that the proposed approach performs better than a simple linear approximation or L2 boosting. The proposed methodology is also illustrated through the Early Dieting in Girls study, which examines the influence of mothers’ overall weight concern on daughters’ dieting behavior.https://doi.org/10.1515/jci-2014-0022boostingdistance correlationdose–response functiongeneralized propensity scoreshigh dimensional
collection DOAJ
language English
format Article
sources DOAJ
author Zhu Yeying
Coffman Donna L.
Ghosh Debashis
spellingShingle Zhu Yeying
Coffman Donna L.
Ghosh Debashis
A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments
Journal of Causal Inference
boosting
distance correlation
dose–response function
generalized propensity scores
high dimensional
author_facet Zhu Yeying
Coffman Donna L.
Ghosh Debashis
author_sort Zhu Yeying
title A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments
title_short A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments
title_full A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments
title_fullStr A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments
title_full_unstemmed A Boosting Algorithm for Estimating Generalized Propensity Scores with Continuous Treatments
title_sort boosting algorithm for estimating generalized propensity scores with continuous treatments
publisher De Gruyter
series Journal of Causal Inference
issn 2193-3677
2193-3685
publishDate 2015-03-01
description In this article, we study the causal inference problem with a continuous treatment variable using propensity score-based methods. For a continuous treatment, the generalized propensity score is defined as the conditional density of the treatment-level given covariates (confounders). The dose–response function is then estimated by inverse probability weighting, where the weights are calculated from the estimated propensity scores. When the dimension of the covariates is large, the traditional nonparametric density estimation suffers from the curse of dimensionality. Some researchers have suggested a two-step estimation procedure by first modeling the mean function. In this study, we suggest a boosting algorithm to estimate the mean function of the treatment given covariates. In boosting, an important tuning parameter is the number of trees to be generated, which essentially determines the trade-off between bias and variance of the causal estimator. We propose a criterion called average absolute correlation coefficient (AACC) to determine the optimal number of trees. Simulation results show that the proposed approach performs better than a simple linear approximation or L2 boosting. The proposed methodology is also illustrated through the Early Dieting in Girls study, which examines the influence of mothers’ overall weight concern on daughters’ dieting behavior.
topic boosting
distance correlation
dose–response function
generalized propensity scores
high dimensional
url https://doi.org/10.1515/jci-2014-0022
work_keys_str_mv AT zhuyeying aboostingalgorithmforestimatinggeneralizedpropensityscoreswithcontinuoustreatments
AT coffmandonnal aboostingalgorithmforestimatinggeneralizedpropensityscoreswithcontinuoustreatments
AT ghoshdebashis aboostingalgorithmforestimatinggeneralizedpropensityscoreswithcontinuoustreatments
AT zhuyeying boostingalgorithmforestimatinggeneralizedpropensityscoreswithcontinuoustreatments
AT coffmandonnal boostingalgorithmforestimatinggeneralizedpropensityscoreswithcontinuoustreatments
AT ghoshdebashis boostingalgorithmforestimatinggeneralizedpropensityscoreswithcontinuoustreatments
_version_ 1717768445874929664