Analytic assessment of multiple-choice tests

Background: Multiple choice tests (MCT),are widely known and applied as useful evaluation tests in the field of education especially in Medical Science. Items on a multiple-choice test consist of a stem, which is followed by a correct answer as well as three to four distracters. Items on a well...

Full description

Bibliographic Details
Main Authors:	Maryam sadat kaveh tabatabaee, Mohammad Hossein Bahreyni Toosi, Akbar Derakhshan, Mohammad Khajeh Dalloee, Hassan Gholami
Format:	Article
Language:	English
Published:	Shaheed Beheshti University of Medical Sciences and Health Services 2003-01-01
Series:	Journal of Medical Education
Subjects:	multiple choice question test analysis reliability item difficulty Discrimination index
Online Access:	http://journals.sbmu.ac.ir/jme/article/view/883

id	doaj-dbcbdef7d4b7438a92a18717136dd6d6
record_format	Article
spelling	doaj-dbcbdef7d4b7438a92a18717136dd6d62020-11-24T23:00:53ZengShaheed Beheshti University of Medical Sciences and Health ServicesJournal of Medical Education1735-39981735-40052003-01-01228791Analytic assessment of multiple-choice testsMaryam sadat kaveh tabatabaee0Mohammad Hossein Bahreyni Toosi1Akbar Derakhshan 2Mohammad Khajeh Dalloee3Hassan Gholami 4faculty member of nursery faculty of Mashad University of Medical Scienceassistant professor of Mashad University of Medical Scienceassociate professor of Mashad University of Medical Science, Director of mashad educational development centerassistant professor of Mashad University of Medical Sciencefaculty member of nursery faculty of Mashad University of Medical ScienceBackground: Multiple choice tests (MCT),are widely known and applied as useful evaluation tests in the field of education especially in Medical Science. Items on a multiple-choice test consist of a stem, which is followed by a correct answer as well as three to four distracters. Items on a well-written multiple-choice test will have stems that are precise and clear, one answer that is clearly correct or best, and distracters that are plausible. Purpose: The purpose of the present study is conducting item and test analysis to 24 MCTs given in first semester of 2000-2001 educational year in medical faculty of Mashad University of Medical Science. Methods: Data of this descriptive study were composed of 1496 MCQs gathered from 2092 answer sheets of 24 MCTs obtained from educational department of the medical faculty. A split-half method of reliability was employed to calculate reliability coefficient for MCTs. Items Difficulty and Discrimination index also were calculated for questions. Further studies should be undertaken for developments the methods for evaluation of validity, assessment of distracters and structural principles in MCTs . Results: Mean reliability coefficient of the exams was 0.72±0.13 and In more than 50% of cases, reliability coefficient was greater than 0.7. There was a significant difference between basic science exams and clinical clerkship exams in Reliability coefficient (P=0.001). Mean standard error a/measurement (SEM) was 3.51±1.11. In 52.2% of the cases, difficulty of MCQs was inappropriate and 49.3% of questions had inadequate discriminative power to discern between poor students and good students. Conclusion: Our finding indicate that only 33% of studied MCQs have desirable or acceptable item difficulty and discrimination indices both and 34.9% of those have no desirable or acceptable item difficulty neither acceptable discrimination index. Having subjects respond reliably on a measure is a great sta11, but there is another concept needed to gel down really well named validity. Keywords: multiple choice question, test analysis, reliability, item difficulty Discrimination index http://journals.sbmu.ac.ir/jme/article/view/883multiple choice questiontest analysisreliabilityitem difficulty Discrimination index
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Maryam sadat kaveh tabatabaee Mohammad Hossein Bahreyni Toosi Akbar Derakhshan Mohammad Khajeh Dalloee Hassan Gholami
spellingShingle	Maryam sadat kaveh tabatabaee Mohammad Hossein Bahreyni Toosi Akbar Derakhshan Mohammad Khajeh Dalloee Hassan Gholami Analytic assessment of multiple-choice tests Journal of Medical Education multiple choice question test analysis reliability item difficulty Discrimination index
author_facet	Maryam sadat kaveh tabatabaee Mohammad Hossein Bahreyni Toosi Akbar Derakhshan Mohammad Khajeh Dalloee Hassan Gholami
author_sort	Maryam sadat kaveh tabatabaee
title	Analytic assessment of multiple-choice tests
title_short	Analytic assessment of multiple-choice tests
title_full	Analytic assessment of multiple-choice tests
title_fullStr	Analytic assessment of multiple-choice tests
title_full_unstemmed	Analytic assessment of multiple-choice tests
title_sort	analytic assessment of multiple-choice tests
publisher	Shaheed Beheshti University of Medical Sciences and Health Services
series	Journal of Medical Education
issn	1735-3998 1735-4005
publishDate	2003-01-01
description	Background: Multiple choice tests (MCT),are widely known and applied as useful evaluation tests in the field of education especially in Medical Science. Items on a multiple-choice test consist of a stem, which is followed by a correct answer as well as three to four distracters. Items on a well-written multiple-choice test will have stems that are precise and clear, one answer that is clearly correct or best, and distracters that are plausible. Purpose: The purpose of the present study is conducting item and test analysis to 24 MCTs given in first semester of 2000-2001 educational year in medical faculty of Mashad University of Medical Science. Methods: Data of this descriptive study were composed of 1496 MCQs gathered from 2092 answer sheets of 24 MCTs obtained from educational department of the medical faculty. A split-half method of reliability was employed to calculate reliability coefficient for MCTs. Items Difficulty and Discrimination index also were calculated for questions. Further studies should be undertaken for developments the methods for evaluation of validity, assessment of distracters and structural principles in MCTs . Results: Mean reliability coefficient of the exams was 0.72±0.13 and In more than 50% of cases, reliability coefficient was greater than 0.7. There was a significant difference between basic science exams and clinical clerkship exams in Reliability coefficient (P=0.001). Mean standard error a/measurement (SEM) was 3.51±1.11. In 52.2% of the cases, difficulty of MCQs was inappropriate and 49.3% of questions had inadequate discriminative power to discern between poor students and good students. Conclusion: Our finding indicate that only 33% of studied MCQs have desirable or acceptable item difficulty and discrimination indices both and 34.9% of those have no desirable or acceptable item difficulty neither acceptable discrimination index. Having subjects respond reliably on a measure is a great sta11, but there is another concept needed to gel down really well named validity. Keywords: multiple choice question, test analysis, reliability, item difficulty Discrimination index
topic	multiple choice question test analysis reliability item difficulty Discrimination index
url	http://journals.sbmu.ac.ir/jme/article/view/883
work_keys_str_mv	AT maryamsadatkavehtabatabaee analyticassessmentofmultiplechoicetests AT mohammadhosseinbahreynitoosi analyticassessmentofmultiplechoicetests AT akbarderakhshan analyticassessmentofmultiplechoicetests AT mohammadkhajehdalloee analyticassessmentofmultiplechoicetests AT hassangholami analyticassessmentofmultiplechoicetests
_version_	1725640980848181248

Analytic assessment of multiple-choice tests

Similar Items