Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
Purpose: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability a...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Taylor & Francis Group
2015-10-01
|
Series: | Medical Education Online |
Subjects: | |
Online Access: | http://med-ed-online.net/index.php/meo/article/view/29242/pdf_84 |
id |
doaj-ba37a32e39dd4a2dbce69f178e380c2d |
---|---|
record_format |
Article |
spelling |
doaj-ba37a32e39dd4a2dbce69f178e380c2d2020-11-24T22:02:23ZengTaylor & Francis GroupMedical Education Online1087-29812015-10-012001510.3402/meo.v20.2924229242Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracyKatrina F. Hurley0Nick A. Giffin1Samuel A. Stewart2Graham B. Bullock3 Department of Emergency Medicine, Dalhousie University, Halifax, NS, Canada Bachelor of Medicine Class of 2016, Dalhousie University, Halifax, NS, Canada 3Division of Medical Informatics, Dalhousie University, Halifax, NS, Canada Department of Emergency Medicine, Dalhousie University, Halifax, NS, CanadaPurpose: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability and observer accuracy as a function of OSCE checklist length. Method: Study participants included emergency physicians and senior residents in Emergency Medicine at Dalhousie University. Participants watched an identical series of four, scripted, standardized videos enacting 10-min OSCE stations and completed corresponding assessment checklists. Each participating observer was provided with a random combination of two 40-item and two 20-item checklists. A panel of physicians scored the scenarios through repeated video review to determine the ‘gold standard’ checklist scores. Results: Fifty-seven observers completed 228 assessment checklists. Mean observer accuracy ranged from 73 to 93% (14.6–18.7/20), with an overall accuracy of 86% (17.2/20), and inter-rater reliability range of 58–78%. After controlling for station and individual variation, no effect was observed regarding the number of checklist items on overall accuracy (p=0.2305). Consistency in ratings was calculated using intraclass correlation coefficient and demonstrated no significant difference in consistency between the 20- and 40-item checklists (ranged from 0.432 to 0.781, p-values from 0.56 to 0.73). Conclusions: The addition of 20 checklist items to a core list of 20 items in an OSCE assessment checklist does not appear to impact observer accuracy or inter-rater reliability.http://med-ed-online.net/index.php/meo/article/view/29242/pdf_84medical educationassessmentOSCEreliabilitymetrics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Katrina F. Hurley Nick A. Giffin Samuel A. Stewart Graham B. Bullock |
spellingShingle |
Katrina F. Hurley Nick A. Giffin Samuel A. Stewart Graham B. Bullock Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy Medical Education Online medical education assessment OSCE reliability metrics |
author_facet |
Katrina F. Hurley Nick A. Giffin Samuel A. Stewart Graham B. Bullock |
author_sort |
Katrina F. Hurley |
title |
Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_short |
Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_full |
Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_fullStr |
Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_full_unstemmed |
Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy |
title_sort |
probing the effect of osce checklist length on inter-observer reliability and observer accuracy |
publisher |
Taylor & Francis Group |
series |
Medical Education Online |
issn |
1087-2981 |
publishDate |
2015-10-01 |
description |
Purpose: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability and observer accuracy as a function of OSCE checklist length. Method: Study participants included emergency physicians and senior residents in Emergency Medicine at Dalhousie University. Participants watched an identical series of four, scripted, standardized videos enacting 10-min OSCE stations and completed corresponding assessment checklists. Each participating observer was provided with a random combination of two 40-item and two 20-item checklists. A panel of physicians scored the scenarios through repeated video review to determine the ‘gold standard’ checklist scores. Results: Fifty-seven observers completed 228 assessment checklists. Mean observer accuracy ranged from 73 to 93% (14.6–18.7/20), with an overall accuracy of 86% (17.2/20), and inter-rater reliability range of 58–78%. After controlling for station and individual variation, no effect was observed regarding the number of checklist items on overall accuracy (p=0.2305). Consistency in ratings was calculated using intraclass correlation coefficient and demonstrated no significant difference in consistency between the 20- and 40-item checklists (ranged from 0.432 to 0.781, p-values from 0.56 to 0.73). Conclusions: The addition of 20 checklist items to a core list of 20 items in an OSCE assessment checklist does not appear to impact observer accuracy or inter-rater reliability. |
topic |
medical education assessment OSCE reliability metrics |
url |
http://med-ed-online.net/index.php/meo/article/view/29242/pdf_84 |
work_keys_str_mv |
AT katrinafhurley probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT nickagiffin probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT samuelastewart probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT grahambbullock probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy |
_version_ |
1725836069871550464 |