Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy

Purpose: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability a...

Full description

Bibliographic Details
Main Authors:	Katrina F. Hurley, Nick A. Giffin, Samuel A. Stewart, Graham B. Bullock
Format:	Article
Language:	English
Published:	Taylor & Francis Group 2015-10-01
Series:	Medical Education Online
Subjects:	medical education assessment OSCE reliability metrics
Online Access:	http://med-ed-online.net/index.php/meo/article/view/29242/pdf_84

id	doaj-ba37a32e39dd4a2dbce69f178e380c2d
record_format	Article
spelling	doaj-ba37a32e39dd4a2dbce69f178e380c2d2020-11-24T22:02:23ZengTaylor & Francis GroupMedical Education Online1087-29812015-10-012001510.3402/meo.v20.2924229242Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracyKatrina F. Hurley0Nick A. Giffin1Samuel A. Stewart2Graham B. Bullock3 Department of Emergency Medicine, Dalhousie University, Halifax, NS, Canada Bachelor of Medicine Class of 2016, Dalhousie University, Halifax, NS, Canada 3Division of Medical Informatics, Dalhousie University, Halifax, NS, Canada Department of Emergency Medicine, Dalhousie University, Halifax, NS, CanadaPurpose: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability and observer accuracy as a function of OSCE checklist length. Method: Study participants included emergency physicians and senior residents in Emergency Medicine at Dalhousie University. Participants watched an identical series of four, scripted, standardized videos enacting 10-min OSCE stations and completed corresponding assessment checklists. Each participating observer was provided with a random combination of two 40-item and two 20-item checklists. A panel of physicians scored the scenarios through repeated video review to determine the ‘gold standard’ checklist scores. Results: Fifty-seven observers completed 228 assessment checklists. Mean observer accuracy ranged from 73 to 93% (14.6–18.7/20), with an overall accuracy of 86% (17.2/20), and inter-rater reliability range of 58–78%. After controlling for station and individual variation, no effect was observed regarding the number of checklist items on overall accuracy (p=0.2305). Consistency in ratings was calculated using intraclass correlation coefficient and demonstrated no significant difference in consistency between the 20- and 40-item checklists (ranged from 0.432 to 0.781, p-values from 0.56 to 0.73). Conclusions: The addition of 20 checklist items to a core list of 20 items in an OSCE assessment checklist does not appear to impact observer accuracy or inter-rater reliability.http://med-ed-online.net/index.php/meo/article/view/29242/pdf_84medical educationassessmentOSCEreliabilitymetrics
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Katrina F. Hurley Nick A. Giffin Samuel A. Stewart Graham B. Bullock
spellingShingle	Katrina F. Hurley Nick A. Giffin Samuel A. Stewart Graham B. Bullock Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy Medical Education Online medical education assessment OSCE reliability metrics
author_facet	Katrina F. Hurley Nick A. Giffin Samuel A. Stewart Graham B. Bullock
author_sort	Katrina F. Hurley
title	Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
title_short	Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
title_full	Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
title_fullStr	Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
title_full_unstemmed	Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy
title_sort	probing the effect of osce checklist length on inter-observer reliability and observer accuracy
publisher	Taylor & Francis Group
series	Medical Education Online
issn	1087-2981
publishDate	2015-10-01
description	Purpose: The Objective Structured Clinical Examination (OSCE) is a widely employed tool for measuring clinical competence. In the drive toward comprehensive assessment, OSCE stations and checklists may become increasingly complex. The objective of this study was to probe inter-observer reliability and observer accuracy as a function of OSCE checklist length. Method: Study participants included emergency physicians and senior residents in Emergency Medicine at Dalhousie University. Participants watched an identical series of four, scripted, standardized videos enacting 10-min OSCE stations and completed corresponding assessment checklists. Each participating observer was provided with a random combination of two 40-item and two 20-item checklists. A panel of physicians scored the scenarios through repeated video review to determine the ‘gold standard’ checklist scores. Results: Fifty-seven observers completed 228 assessment checklists. Mean observer accuracy ranged from 73 to 93% (14.6–18.7/20), with an overall accuracy of 86% (17.2/20), and inter-rater reliability range of 58–78%. After controlling for station and individual variation, no effect was observed regarding the number of checklist items on overall accuracy (p=0.2305). Consistency in ratings was calculated using intraclass correlation coefficient and demonstrated no significant difference in consistency between the 20- and 40-item checklists (ranged from 0.432 to 0.781, p-values from 0.56 to 0.73). Conclusions: The addition of 20 checklist items to a core list of 20 items in an OSCE assessment checklist does not appear to impact observer accuracy or inter-rater reliability.
topic	medical education assessment OSCE reliability metrics
url	http://med-ed-online.net/index.php/meo/article/view/29242/pdf_84
work_keys_str_mv	AT katrinafhurley probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT nickagiffin probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT samuelastewart probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy AT grahambbullock probingtheeffectofoscechecklistlengthoninterobserverreliabilityandobserveraccuracy
_version_	1725836069871550464

Probing the effect of OSCE checklist length on inter-observer reliability and observer accuracy

Similar Items