The Performance of the MIRT Plausible Values Method under Vertical Scaling Design

碩士 === 國立臺中教育大學 === 教育測驗統計研究所 === 101 === 　　Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on....

Full description

Bibliographic Details
Main Authors:	Chen, Wan-Ning, 陳婉寧
Other Authors:	Kuo, Bor-Chen
Format:	Others
Language:	zh-TW
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/01909434611955187801

id	ndltd-TW-101NTCT0629030
record_format	oai_dc
spelling	ndltd-TW-101NTCT06290302017-04-09T04:33:28Z http://ndltd.ncl.edu.tw/handle/01909434611955187801 The Performance of the MIRT Plausible Values Method under Vertical Scaling Design 以可能值方法為基礎之多向度垂直等化之探究 Chen, Wan-Ning 陳婉寧碩士國立臺中教育大學教育測驗統計研究所 101 　　Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs, nowadays. As the NCLB legislation calls, the issues of vertical scaling gets more concerned. Vertical scaling is the way test publishers used to longitudinally evaluate achievement that spans grade levels. For the test structure getting more complicate, researchers have agreed that multidimensional traits come into plays in test. Also, when calibrating items from multiple test forms for the purpose of measuring students across a range of grade levels, the IRT assumption of unidimensionality would appear to be impossible. 　　For the foregoing points, this research is aimed to analysis if: (1) the number of dimensions of the test, (2) the number of item for each dimension, and (3) the method that used to estimate parameters whether or not impact on the recovery of ability parameters of subjects and the recovery of population statistics, based on multidimensional IRT with the vertical scaling design. The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more. Kuo, Bor-Chen 郭伯臣 2013 學位論文 ; thesis 65 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺中教育大學 === 教育測驗統計研究所 === 101 === 　　Large-scale assessment is one of the important approaches that the government used to evaluate the effect of policy of education. The purpose is to monitor population progress. Therefore, population statistics are what the large-scale assessment focus on. Plausible value method is proposed to be a great method that measures population statistics accurately so it is used to provide students’ achievement data by some significant large-scale assessment programs, nowadays. As the NCLB legislation calls, the issues of vertical scaling gets more concerned. Vertical scaling is the way test publishers used to longitudinally evaluate achievement that spans grade levels. For the test structure getting more complicate, researchers have agreed that multidimensional traits come into plays in test. Also, when calibrating items from multiple test forms for the purpose of measuring students across a range of grade levels, the IRT assumption of unidimensionality would appear to be impossible. 　　For the foregoing points, this research is aimed to analysis if: (1) the number of dimensions of the test, (2) the number of item for each dimension, and (3) the method that used to estimate parameters whether or not impact on the recovery of ability parameters of subjects and the recovery of population statistics, based on multidimensional IRT with the vertical scaling design. The result indicates that plausible value method recovers the standard deviation very well but not outstands in recovering the population means. When using MIRT vertical design, parameters are estimated better when the number of items is more.
author2	Kuo, Bor-Chen
author_facet	Kuo, Bor-Chen Chen, Wan-Ning 陳婉寧
author	Chen, Wan-Ning 陳婉寧
spellingShingle	Chen, Wan-Ning 陳婉寧 The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
author_sort	Chen, Wan-Ning
title	The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_short	The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_full	The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_fullStr	The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_full_unstemmed	The Performance of the MIRT Plausible Values Method under Vertical Scaling Design
title_sort	performance of the mirt plausible values method under vertical scaling design
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/01909434611955187801
work_keys_str_mv	AT chenwanning theperformanceofthemirtplausiblevaluesmethodunderverticalscalingdesign AT chénwǎnníng theperformanceofthemirtplausiblevaluesmethodunderverticalscalingdesign AT chenwanning yǐkěnéngzhífāngfǎwèijīchǔzhīduōxiàngdùchuízhíděnghuàzhītànjiū AT chénwǎnníng yǐkěnéngzhífāngfǎwèijīchǔzhīduōxiàngdùchuízhíděnghuàzhītànjiū AT chenwanning performanceofthemirtplausiblevaluesmethodunderverticalscalingdesign AT chénwǎnníng performanceofthemirtplausiblevaluesmethodunderverticalscalingdesign
_version_	1718437521845649408

The Performance of the MIRT Plausible Values Method under Vertical Scaling Design

Similar Items