Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods

This study involves investigating the impacts of ignoring nested data structure in Rasch/1PL item response theory (IRT) model via a two-level and three-level hierarchical generalized linear model (HGLM). Currently, Rasch/IRT models are frequently used in educational and psychometric researches for d...

Full description

Bibliographic Details
Main Author: Chungbaek, Youngyun
Other Authors: Educational Leadership and Policy Studies
Format: Others
Language:en_US
Published: Virginia Tech 2017
Subjects:
AGQ
PQL
IRT
Online Access:http://hdl.handle.net/10919/77086
http://scholar.lib.vt.edu/theses/available/etd-05162011-110459/
id ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-77086
record_format oai_dc
spelling ndltd-VTETD-oai-vtechworks.lib.vt.edu-10919-770862020-09-29T05:30:22Z Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods Chungbaek, Youngyun Educational Leadership and Policy Studies Miyazaki, Yasuo Hein, Serge F. Chang, Mido Skaggs, Gary E. theoretical standard error. RMSE MCSE bias AGQ Laplace PQL IRT Rasch HGLM nested data structure Monte Carlo simulation This study involves investigating the impacts of ignoring nested data structure in Rasch/1PL item response theory (IRT) model via a two-level and three-level hierarchical generalized linear model (HGLM). Currently, Rasch/IRT models are frequently used in educational and psychometric researches for data obtained from multistage cluster samplings, which are more likely to violate the assumption of independent observations of examinees required by Rasch/IRT models. The violation of the assumption of independent observation, however, is ignored in the current standard practices which apply the standard Rasch/IRT for the large scale testing data. A simulation study (Study Two) was conducted to address this issue of the effects of ignoring nested data structure in Rasch/IRT models under various conditions, following a simulation study (Study One) to compare the performances of three methods, such as Penalized Quasi-Likelihood (PQL), Laplace approximation, and Adaptive Gaussian Quadrature (AGQ), commonly used in HGLM in terms of accuracy and efficiency in estimating parameters. As expected, PQL tended to produce seriously biased item difficulty estimates and ability variance estimates whereas almost unbiased for Laplace or AGQ for both 2-level and 3-level analysis. As for the root mean squared errors (RMSE), three methods performed without substantive differences for item difficulty estimates and ability variance estimates in both 2-level and 3-level analysis, except for level-2 ability variance estimates in 3-level analysis. Generally, Laplace and AGQ performed similarly well in terms of bias and RMSE of parameter estimates; however, Laplace exhibited a much lower convergence rate than that of AGQ in 3-level analyses. The results from AGQ, which produced the most accurate and stable results among three computational methods, demonstrated that the theoretical standard errors (SE), i.e., asymptotic information-based SEs, were underestimated by at most 34% when 2-level analyses were used for the data generated from 3-level model, implying that the Type I error rate would be inflated when the nested data structures are ignored in Rasch/IRT models. The underestimated theoretical standard errors were substantively more severe as the true ability variance increased or the number of students within schools increased regardless of test length or the number of schools. Ph. D. 2017-04-06T15:42:34Z 2017-04-06T15:42:34Z 2011-05-02 2011-05-16 2016-09-27 2011-06-06 Dissertation Text etd-05162011-110459 http://hdl.handle.net/10919/77086 http://scholar.lib.vt.edu/theses/available/etd-05162011-110459/ en_US In Copyright http://rightsstatements.org/vocab/InC/1.0/ application/pdf Virginia Tech
collection NDLTD
language en_US
format Others
sources NDLTD
topic theoretical standard error.
RMSE
MCSE
bias
AGQ
Laplace
PQL
IRT
Rasch
HGLM
nested data structure
Monte Carlo simulation
spellingShingle theoretical standard error.
RMSE
MCSE
bias
AGQ
Laplace
PQL
IRT
Rasch
HGLM
nested data structure
Monte Carlo simulation
Chungbaek, Youngyun
Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods
description This study involves investigating the impacts of ignoring nested data structure in Rasch/1PL item response theory (IRT) model via a two-level and three-level hierarchical generalized linear model (HGLM). Currently, Rasch/IRT models are frequently used in educational and psychometric researches for data obtained from multistage cluster samplings, which are more likely to violate the assumption of independent observations of examinees required by Rasch/IRT models. The violation of the assumption of independent observation, however, is ignored in the current standard practices which apply the standard Rasch/IRT for the large scale testing data. A simulation study (Study Two) was conducted to address this issue of the effects of ignoring nested data structure in Rasch/IRT models under various conditions, following a simulation study (Study One) to compare the performances of three methods, such as Penalized Quasi-Likelihood (PQL), Laplace approximation, and Adaptive Gaussian Quadrature (AGQ), commonly used in HGLM in terms of accuracy and efficiency in estimating parameters. As expected, PQL tended to produce seriously biased item difficulty estimates and ability variance estimates whereas almost unbiased for Laplace or AGQ for both 2-level and 3-level analysis. As for the root mean squared errors (RMSE), three methods performed without substantive differences for item difficulty estimates and ability variance estimates in both 2-level and 3-level analysis, except for level-2 ability variance estimates in 3-level analysis. Generally, Laplace and AGQ performed similarly well in terms of bias and RMSE of parameter estimates; however, Laplace exhibited a much lower convergence rate than that of AGQ in 3-level analyses. The results from AGQ, which produced the most accurate and stable results among three computational methods, demonstrated that the theoretical standard errors (SE), i.e., asymptotic information-based SEs, were underestimated by at most 34% when 2-level analyses were used for the data generated from 3-level model, implying that the Type I error rate would be inflated when the nested data structures are ignored in Rasch/IRT models. The underestimated theoretical standard errors were substantively more severe as the true ability variance increased or the number of students within schools increased regardless of test length or the number of schools. === Ph. D.
author2 Educational Leadership and Policy Studies
author_facet Educational Leadership and Policy Studies
Chungbaek, Youngyun
author Chungbaek, Youngyun
author_sort Chungbaek, Youngyun
title Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods
title_short Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods
title_full Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods
title_fullStr Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods
title_full_unstemmed Impacts of Ignoring Nested Data Structure in Rasch/IRT Model and Comparison of Different Estimation Methods
title_sort impacts of ignoring nested data structure in rasch/irt model and comparison of different estimation methods
publisher Virginia Tech
publishDate 2017
url http://hdl.handle.net/10919/77086
http://scholar.lib.vt.edu/theses/available/etd-05162011-110459/
work_keys_str_mv AT chungbaekyoungyun impactsofignoringnesteddatastructureinraschirtmodelandcomparisonofdifferentestimationmethods
_version_ 1719343177443311616