CONSTRAINED STATISTICAL INFERENCE WHEN TARGET AND SAMPLE POPULATIONS DIFFER

When analyzing an I × J contingency table, there are situations where sampling is taken from a sampled population that differs from the target population. Clearly the resulting estimators are typically biased. In this dissertation, four adjusting methods for estimating the cell probabilities under i...

Full description

Bibliographic Details
Main Author: Yi, Huijun
Format: Others
Published: OpenSIUC 2014
Online Access:https://opensiuc.lib.siu.edu/dissertations/870
https://opensiuc.lib.siu.edu/cgi/viewcontent.cgi?article=1873&context=dissertations
Description
Summary:When analyzing an I × J contingency table, there are situations where sampling is taken from a sampled population that differs from the target population. Clearly the resulting estimators are typically biased. In this dissertation, four adjusting methods for estimating the cell probabilities under inequality constrains, namely, raking (RAKE), maximum likelihood under random sampling (MLRS), minimum chi-squared (MCSQ), and least squares (LSQ) are developed for particular models relating the target and sampled populations. Considering the difficulty of solving primal problem due to large dimensions, we use the Khun-Tucker conditions to exploit the duality for each method. Extensive simulation is performed to provide a systematic comparison between adjusting methods. The comparisons are also made by using a measure of information loss because of biased sampling. We apply four methods to the second National Health and Nutrition Examination Survey data under reasonable constraints. Not only the performance of four methods are compared in the example, but also how different constraints affect the quality of estimation is inspected. We emphasize that a sampling is taken from a sample population that differs from a target population. In the absence of knowledge of target population, a prior distribution can be considered as describing the initial knowledge about the target population. In the dissertation we also discuss Bayesian analysis with the use of a proper prior to measure missing information.