Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model

We extend the twofold small-area model of Stukel and Rao (1997; 1999) to accommodate binary data. An example is the Third International Mathematics and Science Study (TIMSS), in which pass-fail data for mathematics of students from US schools (clusters) are available at the third grade by regions an...

Full description

Bibliographic Details
Main Author: Nandram Balgobin
Format: Article
Language:English
Published: Sciendo 2016-03-01
Series:Journal of Official Statistics
Subjects:
Online Access:https://doi.org/10.1515/jos-2016-0009
id doaj-10db76ffdbd74f619c0da0eaf3e1b7e6
record_format Article
spelling doaj-10db76ffdbd74f619c0da0eaf3e1b7e62021-09-06T19:40:51ZengSciendoJournal of Official Statistics2001-73672016-03-0132118720810.1515/jos-2016-0009jos-2016-0009Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area ModelNandram Balgobin0Department of Mathematical Sciences, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609, U.S.A.We extend the twofold small-area model of Stukel and Rao (1997; 1999) to accommodate binary data. An example is the Third International Mathematics and Science Study (TIMSS), in which pass-fail data for mathematics of students from US schools (clusters) are available at the third grade by regions and communities (small areas). We compare the finite population proportions of these small areas. We present a hierarchical Bayesian model in which the firststage binary responses have independent Bernoulli distributions, and each subsequent stage is modeled using a beta distribution, which is parameterized by its mean and a correlation coefficient. This twofold small-area model has an intracluster correlation at the first stage and an intercluster correlation at the second stage. The final-stage mean and all correlations are assumed to be noninformative independent random variables. We show how to infer the finite population proportion of each area. We have applied our models to synthetic TIMSS data to show that the twofold model is preferred over a onefold small-area model that ignores the clustering within areas. We further compare these models using a simulation study, which shows that the intracluster correlation is particularly important.https://doi.org/10.1515/jos-2016-0009intracluster and intercluster correlationscredible intervalsgoodness of fithierarchical modelsimulation study
collection DOAJ
language English
format Article
sources DOAJ
author Nandram Balgobin
spellingShingle Nandram Balgobin
Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
Journal of Official Statistics
intracluster and intercluster correlations
credible intervals
goodness of fit
hierarchical model
simulation study
author_facet Nandram Balgobin
author_sort Nandram Balgobin
title Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
title_short Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
title_full Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
title_fullStr Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
title_full_unstemmed Bayesian Predictive Inference of a Proportion Under a Twofold Small-Area Model
title_sort bayesian predictive inference of a proportion under a twofold small-area model
publisher Sciendo
series Journal of Official Statistics
issn 2001-7367
publishDate 2016-03-01
description We extend the twofold small-area model of Stukel and Rao (1997; 1999) to accommodate binary data. An example is the Third International Mathematics and Science Study (TIMSS), in which pass-fail data for mathematics of students from US schools (clusters) are available at the third grade by regions and communities (small areas). We compare the finite population proportions of these small areas. We present a hierarchical Bayesian model in which the firststage binary responses have independent Bernoulli distributions, and each subsequent stage is modeled using a beta distribution, which is parameterized by its mean and a correlation coefficient. This twofold small-area model has an intracluster correlation at the first stage and an intercluster correlation at the second stage. The final-stage mean and all correlations are assumed to be noninformative independent random variables. We show how to infer the finite population proportion of each area. We have applied our models to synthetic TIMSS data to show that the twofold model is preferred over a onefold small-area model that ignores the clustering within areas. We further compare these models using a simulation study, which shows that the intracluster correlation is particularly important.
topic intracluster and intercluster correlations
credible intervals
goodness of fit
hierarchical model
simulation study
url https://doi.org/10.1515/jos-2016-0009
work_keys_str_mv AT nandrambalgobin bayesianpredictiveinferenceofaproportionunderatwofoldsmallareamodel
_version_ 1717767629600456704