The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
碩士 === 國立臺中教育大學 === 教育測驗統計研究所 === 101 === Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on....
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/01909434611955187801 |
id |
ndltd-TW-101NTCT0629030 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101NTCT06290302017-04-09T04:33:28Z http://ndltd.ncl.edu.tw/handle/01909434611955187801 The Performance of the MIRT Plausible Values Method under Vertical Scaling Design 以可能值方法為基礎之多向度垂直等化之探究 Chen, Wan-Ning 陳婉寧 碩士 國立臺中教育大學 教育測驗統計研究所 101 Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs, nowadays. As the NCLB legislation calls, the issues of vertical scaling gets more concerned. Vertical scaling is the way test publishers used to longitudinally evaluate achievement that spans grade levels. For the test structure getting more complicate, researchers have agreed that multidimensional traits come into plays in test. Also, when calibrating items from multiple test forms for the purpose of measuring students across a range of grade levels, the IRT assumption of unidimensionality would appear to be impossible. For the foregoing points, this research is aimed to analysis if: (1) the number of dimensions of the test, (2) the number of item for each dimension, and (3) the method that used to estimate parameters whether or not impact on the recovery of ability parameters of subjects and the recovery of population statistics, based on multidimensional IRT with the vertical scaling design. The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more. Kuo, Bor-Chen 郭伯臣 2013 學位論文 ; thesis 65 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺中教育大學 === 教育測驗統計研究所 === 101 === Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs, nowadays. As the NCLB legislation calls, the issues of vertical scaling gets more concerned. Vertical scaling is the way test publishers used to longitudinally evaluate achievement that spans grade levels. For the test structure getting more complicate, researchers have agreed that multidimensional traits come into plays in test. Also, when calibrating items from multiple test forms for the purpose of measuring students across a range of grade levels, the IRT assumption of unidimensionality would appear to be impossible.
For the foregoing points, this research is aimed to analysis if: (1) the number of dimensions of the test, (2) the number of item for each dimension, and (3) the method that used to estimate parameters whether or not impact on the recovery of ability parameters of subjects and the recovery of population statistics, based on multidimensional IRT with the vertical scaling design.
The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more.
|
author2 |
Kuo, Bor-Chen |
author_facet |
Kuo, Bor-Chen Chen, Wan-Ning 陳婉寧 |
author |
Chen, Wan-Ning 陳婉寧 |
spellingShingle |
Chen, Wan-Ning 陳婉寧 The Performance of the MIRT Plausible Values Method under Vertical Scaling Design |
author_sort |
Chen, Wan-Ning |
title |
The Performance of the MIRT Plausible Values Method under Vertical Scaling Design |
title_short |
The Performance of the MIRT Plausible Values Method under Vertical Scaling Design |
title_full |
The Performance of the MIRT Plausible Values Method under Vertical Scaling Design |
title_fullStr |
The Performance of the MIRT Plausible Values Method under Vertical Scaling Design |
title_full_unstemmed |
The Performance of the MIRT Plausible Values Method under Vertical Scaling Design |
title_sort |
performance of the mirt plausible values method under vertical scaling design |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/01909434611955187801 |
work_keys_str_mv |
AT chenwanning theperformanceofthemirtplausiblevaluesmethodunderverticalscalingdesign AT chénwǎnníng theperformanceofthemirtplausiblevaluesmethodunderverticalscalingdesign AT chenwanning yǐkěnéngzhífāngfǎwèijīchǔzhīduōxiàngdùchuízhíděnghuàzhītànjiū AT chénwǎnníng yǐkěnéngzhífāngfǎwèijīchǔzhīduōxiàngdùchuízhíděnghuàzhītànjiū AT chenwanning performanceofthemirtplausiblevaluesmethodunderverticalscalingdesign AT chénwǎnníng performanceofthemirtplausiblevaluesmethodunderverticalscalingdesign |
_version_ |
1718437521845649408 |