Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)

When using inverse probability weighting (IPW) and doubly robust (DR) estimators for estimating the causal effect we need to use nuisance models for estimating the propensity scores, and for the DR estimator also the outcome regression. These nuisance models are often created using a selection of co...

Full description

Bibliographic Details
Main Author: Hellman, Samuel
Format: Others
Language:English
Published: Umeå universitet, Statistik 2016
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-122686
id ndltd-UPSALLA1-oai-DiVA.org-umu-122686
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-umu-1226862016-06-22T05:21:38ZEvaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)engUtvärdering av modellselektionskriterier för skattning av störningsparametrar inom kausal inferens : (När de sanna modellerna består av en blandning av ett ändligt antal fördelningar)Hellman, SamuelUmeå universitet, Statistik2016When using inverse probability weighting (IPW) and doubly robust (DR) estimators for estimating the causal effect we need to use nuisance models for estimating the propensity scores, and for the DR estimator also the outcome regression. These nuisance models are often created using a selection of covariates, higher order and interaction terms, which require some model selection criteria. In this paper we will study selection criteria and their performance in fitting nuisance models for semi-parametric estimation of the causal eect. This will be done as a simulation study. Apart from the usual simulation design using known parametric models for the true values of the outcomes and propensity scores we will also use finite mixture model simulation. From that we can evaluate how changing the design approach changes the characteristics of the nuisance models corresponding to the different criteria, and if we can still find the causal effect using them. In all simulations we found that the Bayesian Information Criteria generally creates the nuisance models with the least amounts of estimated parameters, especially for estimating the propensity scores where it rarely uses more than one covariate. Due to this the IPW estimations of the causal effect was generally more biased for that criteria than those made using other criteria. When using DR estimators the robustness of the estimator causes all criteria to create equally unbiased estimations with low variance. Using LASSO for model selection was generally the best as it created estimations with the lowest bias and variance. When using finite mixture models in the simulation design the differences between the estimations corresponding to the different criteria disappear. When we only use mixtures for the propensity scores we can still find the true causal effect, even for the IPW estimators that only use the propensity scores as a nuisance parameter. When applying finite mixture models to both the propensity scores and the outcome regressions we overestimate the causal effect for all criteria, and we see little to no difference between the estimators made using the dierent criteria. In that case we also could not find the true causal effect using a non-parametric matching estimator. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-122686application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
description When using inverse probability weighting (IPW) and doubly robust (DR) estimators for estimating the causal effect we need to use nuisance models for estimating the propensity scores, and for the DR estimator also the outcome regression. These nuisance models are often created using a selection of covariates, higher order and interaction terms, which require some model selection criteria. In this paper we will study selection criteria and their performance in fitting nuisance models for semi-parametric estimation of the causal eect. This will be done as a simulation study. Apart from the usual simulation design using known parametric models for the true values of the outcomes and propensity scores we will also use finite mixture model simulation. From that we can evaluate how changing the design approach changes the characteristics of the nuisance models corresponding to the different criteria, and if we can still find the causal effect using them. In all simulations we found that the Bayesian Information Criteria generally creates the nuisance models with the least amounts of estimated parameters, especially for estimating the propensity scores where it rarely uses more than one covariate. Due to this the IPW estimations of the causal effect was generally more biased for that criteria than those made using other criteria. When using DR estimators the robustness of the estimator causes all criteria to create equally unbiased estimations with low variance. Using LASSO for model selection was generally the best as it created estimations with the lowest bias and variance. When using finite mixture models in the simulation design the differences between the estimations corresponding to the different criteria disappear. When we only use mixtures for the propensity scores we can still find the true causal effect, even for the IPW estimators that only use the propensity scores as a nuisance parameter. When applying finite mixture models to both the propensity scores and the outcome regressions we overestimate the causal effect for all criteria, and we see little to no difference between the estimators made using the dierent criteria. In that case we also could not find the true causal effect using a non-parametric matching estimator.
author Hellman, Samuel
spellingShingle Hellman, Samuel
Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
author_facet Hellman, Samuel
author_sort Hellman, Samuel
title Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
title_short Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
title_full Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
title_fullStr Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
title_full_unstemmed Evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
title_sort evaluating model selection criteria for nuisance models in causal inference : (when the true models are finite mixtures)
publisher Umeå universitet, Statistik
publishDate 2016
url http://urn.kb.se/resolve?urn=urn:nbn:se:umu:diva-122686
work_keys_str_mv AT hellmansamuel evaluatingmodelselectioncriteriafornuisancemodelsincausalinferencewhenthetruemodelsarefinitemixtures
AT hellmansamuel utvarderingavmodellselektionskriterierforskattningavstorningsparametrarinomkausalinferensnardesannamodellernabestaravenblandningavettandligtantalfordelningar
_version_ 1718320668811984896