Analytic assessment of multiple-choice tests
Background: Multiple choice tests (MCT),are widely known and applied as useful evaluation tests in the field of education especially in Medical Science. Items on a multiple-choice test consist of a stem, which is followed by a correct answer as well as three to four distracters. Items on a well...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Shaheed Beheshti University of Medical Sciences and Health Services
2003-01-01
|
Series: | Journal of Medical Education |
Subjects: | |
Online Access: | http://journals.sbmu.ac.ir/jme/article/view/883 |
id |
doaj-dbcbdef7d4b7438a92a18717136dd6d6 |
---|---|
record_format |
Article |
spelling |
doaj-dbcbdef7d4b7438a92a18717136dd6d62020-11-24T23:00:53ZengShaheed Beheshti University of Medical Sciences and Health ServicesJournal of Medical Education1735-39981735-40052003-01-01228791Analytic assessment of multiple-choice testsMaryam sadat kaveh tabatabaee0Mohammad Hossein Bahreyni Toosi1Akbar Derakhshan 2Mohammad Khajeh Dalloee3Hassan Gholami 4faculty member of nursery faculty of Mashad University of Medical Scienceassistant professor of Mashad University of Medical Scienceassociate professor of Mashad University of Medical Science, Director of mashad educational development centerassistant professor of Mashad University of Medical Sciencefaculty member of nursery faculty of Mashad University of Medical ScienceBackground: Multiple choice tests (MCT),are widely known and applied as useful evaluation tests in the field of education especially in Medical Science. Items on a multiple-choice test consist of a stem, which is followed by a correct answer as well as three to four distracters. Items on a well-written multiple-choice test will have stems that are precise and clear, one answer that is clearly correct or best, and distracters that are plausible. Purpose: The purpose of the present study is conducting item and test analysis to 24 MCTs given in first semester of 2000-2001 educational year in medical faculty of Mashad University of Medical Science. Methods: Data of this descriptive study were composed of 1496 MCQs gathered from 2092 answer sheets of 24 MCTs obtained from educational department of the medical faculty. A split-half method of reliability was employed to calculate reliability coefficient for MCTs. Items Difficulty and Discrimination index also were calculated for questions. Further studies should be undertaken for developments the methods for evaluation of validity, assessment of distracters and structural principles in MCTs . Results: Mean reliability coefficient of the exams was 0.72±0.13 and In more than 50% of cases, reliability coefficient was greater than 0.7. There was a significant difference between basic science exams and clinical clerkship exams in Reliability coefficient (P=0.001). Mean standard error a/measurement (SEM) was 3.51±1.11. In 52.2% of the cases, difficulty of MCQs was inappropriate and 49.3% of questions had inadequate discriminative power to discern between poor students and good students. Conclusion: Our finding indicate that only 33% of studied MCQs have desirable or acceptable item difficulty and discrimination indices both and 34.9% of those have no desirable or acceptable item difficulty neither acceptable discrimination index. Having subjects respond reliably on a measure is a great sta11, but there is another concept needed to gel down really well named validity. Keywords: multiple choice question, test analysis, reliability, item difficulty Discrimination index http://journals.sbmu.ac.ir/jme/article/view/883multiple choice questiontest analysisreliabilityitem difficulty Discrimination index |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Maryam sadat kaveh tabatabaee Mohammad Hossein Bahreyni Toosi Akbar Derakhshan Mohammad Khajeh Dalloee Hassan Gholami |
spellingShingle |
Maryam sadat kaveh tabatabaee Mohammad Hossein Bahreyni Toosi Akbar Derakhshan Mohammad Khajeh Dalloee Hassan Gholami Analytic assessment of multiple-choice tests Journal of Medical Education multiple choice question test analysis reliability item difficulty Discrimination index |
author_facet |
Maryam sadat kaveh tabatabaee Mohammad Hossein Bahreyni Toosi Akbar Derakhshan Mohammad Khajeh Dalloee Hassan Gholami |
author_sort |
Maryam sadat kaveh tabatabaee |
title |
Analytic assessment of multiple-choice tests |
title_short |
Analytic assessment of multiple-choice tests |
title_full |
Analytic assessment of multiple-choice tests |
title_fullStr |
Analytic assessment of multiple-choice tests |
title_full_unstemmed |
Analytic assessment of multiple-choice tests |
title_sort |
analytic assessment of multiple-choice tests |
publisher |
Shaheed Beheshti University of Medical Sciences and Health Services |
series |
Journal of Medical Education |
issn |
1735-3998 1735-4005 |
publishDate |
2003-01-01 |
description |
Background: Multiple choice tests (MCT),are widely known and applied as useful evaluation tests in the field of education especially in Medical Science. Items on a multiple-choice test consist of a stem, which is followed by a correct answer as well as three to four distracters. Items on a well-written multiple-choice test will have stems that are precise and clear, one answer that is clearly correct or best, and distracters that are plausible.
Purpose: The purpose of the present study is conducting item and test analysis to 24 MCTs given in first semester of 2000-2001 educational year in medical faculty of Mashad University of Medical Science.
Methods: Data of this descriptive study were composed of 1496 MCQs gathered from 2092 answer sheets of 24 MCTs obtained from educational department of the medical faculty. A split-half method of reliability was employed to calculate reliability coefficient for MCTs. Items Difficulty and Discrimination index also were calculated for questions. Further studies should be undertaken for developments the methods for evaluation of validity, assessment of distracters and structural principles in MCTs .
Results: Mean reliability coefficient of the exams was 0.72±0.13 and In more than 50% of cases, reliability coefficient was greater than 0.7. There was a significant difference between basic science exams and clinical clerkship exams in Reliability coefficient (P=0.001). Mean standard error a/measurement (SEM) was 3.51±1.11. In 52.2% of the cases, difficulty of MCQs was inappropriate and 49.3% of questions had inadequate discriminative power to discern between poor students and good students.
Conclusion: Our finding indicate that only 33% of studied MCQs have desirable or acceptable item difficulty and discrimination indices both and 34.9% of those have no desirable or acceptable item difficulty neither acceptable discrimination index. Having subjects respond reliably on a measure is a great sta11, but there is another concept
needed to gel down really well named validity.
Keywords: multiple choice question, test analysis, reliability, item difficulty Discrimination index
|
topic |
multiple choice question test analysis reliability item difficulty Discrimination index |
url |
http://journals.sbmu.ac.ir/jme/article/view/883 |
work_keys_str_mv |
AT maryamsadatkavehtabatabaee analyticassessmentofmultiplechoicetests AT mohammadhosseinbahreynitoosi analyticassessmentofmultiplechoicetests AT akbarderakhshan analyticassessmentofmultiplechoicetests AT mohammadkhajehdalloee analyticassessmentofmultiplechoicetests AT hassangholami analyticassessmentofmultiplechoicetests |
_version_ |
1725640980848181248 |