Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
We extend the twofold small-area model of Stukel and Rao (1997; 1999) to accommodate binary data. An example is the Third International Mathematics and Science Study (TIMSS), in which pass-fail data for mathematics of students from US schools (clusters) are available at the third grade by regions an...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Sciendo
2016-03-01
|
Series: | Journal of Official Statistics |
Subjects: | |
Online Access: | https://doi.org/10.1515/jos-2016-0009 |
id |
doaj-10db76ffdbd74f619c0da0eaf3e1b7e6 |
---|---|
record_format |
Article |
spelling |
doaj-10db76ffdbd74f619c0da0eaf3e1b7e62021-09-06T19:40:51ZengSciendoJournal of Official Statistics2001-73672016-03-0132118720810.1515/jos-2016-0009jos-2016-0009Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area ModelNandram Balgobin0Department of Mathematical Sciences, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609, U.S.A.We extend the twofold small-area model of Stukel and Rao (1997; 1999) to accommodate binary data. An example is the Third International Mathematics and Science Study (TIMSS), in which pass-fail data for mathematics of students from US schools (clusters) are available at the third grade by regions and communities (small areas). We compare the finite population proportions of these small areas. We present a hierarchical Bayesian model in which the firststage binary responses have independent Bernoulli distributions, and each subsequent stage is modeled using a beta distribution, which is parameterized by its mean and a correlation coefficient. This twofold small-area model has an intracluster correlation at the first stage and an intercluster correlation at the second stage. The final-stage mean and all correlations are assumed to be noninformative independent random variables. We show how to infer the finite population proportion of each area. We have applied our models to synthetic TIMSS data to show that the twofold model is preferred over a onefold small-area model that ignores the clustering within areas. We further compare these models using a simulation study, which shows that the intracluster correlation is particularly important.https://doi.org/10.1515/jos-2016-0009intracluster and intercluster correlationscredible intervalsgoodness of fithierarchical modelsimulation study |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nandram Balgobin |
spellingShingle |
Nandram Balgobin Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model Journal of Official Statistics intracluster and intercluster correlations credible intervals goodness of fit hierarchical model simulation study |
author_facet |
Nandram Balgobin |
author_sort |
Nandram Balgobin |
title |
Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model |
title_short |
Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model |
title_full |
Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model |
title_fullStr |
Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model |
title_full_unstemmed |
Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model |
title_sort |
bayesian predictive inference of a proportion under a twofold small-area model |
publisher |
Sciendo |
series |
Journal of Official Statistics |
issn |
2001-7367 |
publishDate |
2016-03-01 |
description |
We extend the twofold small-area model of Stukel and Rao (1997; 1999) to accommodate binary data. An example is the Third International Mathematics and Science Study (TIMSS), in which pass-fail data for mathematics of students from US schools (clusters) are available at the third grade by regions and communities (small areas). We compare the finite population proportions of these small areas. We present a hierarchical Bayesian model in which the firststage binary responses have independent Bernoulli distributions, and each subsequent stage is modeled using a beta distribution, which is parameterized by its mean and a correlation coefficient. This twofold small-area model has an intracluster correlation at the first stage and an intercluster correlation at the second stage. The final-stage mean and all correlations are assumed to be noninformative independent random variables. We show how to infer the finite population proportion of each area. We have applied our models to synthetic TIMSS data to show that the twofold model is preferred over a onefold small-area model that ignores the clustering within areas. We further compare these models using a simulation study, which shows that the intracluster correlation is particularly important. |
topic |
intracluster and intercluster correlations credible intervals goodness of fit hierarchical model simulation study |
url |
https://doi.org/10.1515/jos-2016-0009 |
work_keys_str_mv |
AT nandrambalgobin bayesianpredictiveinferenceofaproportionunderatwofoldsmallareamodel |
_version_ |
1717767629600456704 |