The Performance of the MIRT Plausible Values Method under Vertical Scaling Design

碩士 === 國立臺中教育大學 === 教育測驗統計研究所 === 101 ===   Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on....

Full description

Bibliographic Details
Main Authors: Chen, Wan-Ning, 陳婉寧
Other Authors: Kuo, Bor-Chen
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/01909434611955187801
id ndltd-TW-101NTCT0629030
record_format oai_dc
spelling ndltd-TW-101NTCT06290302017-04-09T04:33:28Z http://ndltd.ncl.edu.tw/handle/01909434611955187801 The Performance of the MIRT Plausible Values Method under Vertical Scaling Design 以可能值方法為基礎之多向度垂直等化之探究 Chen, Wan-Ning 陳婉寧 碩士 國立臺中教育大學 教育測驗統計研究所 101   Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs, nowadays. As the NCLB legislation calls, the issues of vertical scaling gets more concerned. Vertical scaling is the way test publishers used to longitudinally evaluate achievement that spans grade levels. For the test structure getting more complicate, researchers have agreed that multidimensional traits come into plays in test. Also, when calibrating items from multiple test forms for the purpose of measuring students across a range of grade levels, the IRT assumption of unidimensionality would appear to be impossible.   For the foregoing points, this research is aimed to analysis if: (1) the number of dimensions of the test, (2) the number of item for each dimension, and (3) the method that used to estimate parameters whether or not impact on the recovery of ability parameters of subjects and the recovery of population statistics, based on multidimensional IRT with the vertical scaling design. The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more. Kuo, Bor-Chen 郭伯臣 2013 學位論文 ; thesis 65 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺中教育大學 === 教育測驗統計研究所 === 101 ===   Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs, nowadays. As the NCLB legislation calls, the issues of vertical scaling gets more concerned. Vertical scaling is the way test publishers used to longitudinally evaluate achievement that spans grade levels. For the test structure getting more complicate, researchers have agreed that multidimensional traits come into plays in test. Also, when calibrating items from multiple test forms for the purpose of measuring students across a range of grade levels, the IRT assumption of unidimensionality would appear to be impossible.   For the foregoing points, this research is aimed to analysis if: (1) the number of dimensions of the test, (2) the number of item for each dimension, and (3) the method that used to estimate parameters whether or not impact on the recovery of ability parameters of subjects and the recovery of population statistics, based on multidimensional IRT with the vertical scaling design. The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more.
author2 Kuo, Bor-Chen
author_facet Kuo, Bor-Chen
Chen, Wan-Ning
陳婉寧
author Chen, Wan-Ning
陳婉寧
spellingShingle Chen, Wan-Ning
陳婉寧
The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
author_sort Chen, Wan-Ning
title The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_short The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_full The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_fullStr The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_full_unstemmed The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_sort performance of the mirt plausible values method under vertical scaling design
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/01909434611955187801
work_keys_str_mv AT chenwanning theperformanceofthemirtplausiblevaluesmethodunderverticalscalingdesign
AT chénwǎnníng theperformanceofthemirtplausiblevaluesmethodunderverticalscalingdesign
AT chenwanning yǐkěnéngzhífāngfǎwèijīchǔzhīduōxiàngdùchuízhíděnghuàzhītànjiū
AT chénwǎnníng yǐkěnéngzhífāngfǎwèijīchǔzhīduōxiàngdùchuízhíděnghuàzhītànjiū
AT chenwanning performanceofthemirtplausiblevaluesmethodunderverticalscalingdesign
AT chénwǎnníng performanceofthemirtplausiblevaluesmethodunderverticalscalingdesign
_version_ 1718437521845649408