Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department

Objectives. To measure inter-rater agreement of overall clinical appearance of febrile children aged less than 24 months and to compare methods for doing so.Study Design and Setting. We performed an observational study of inter-rater reliability of the assessment of febrile children in a county hosp...

Full description

Bibliographic Details
Main Authors: Paul Walsh, Justin Thornton, Julie Asato, Nicholas Walker, Gary McCoy, Joe Baal, Jed Baal, Nanse Mendoza, Faried Banimahd
Format: Article
Language:English
Published: PeerJ Inc. 2014-11-01
Series:PeerJ
Subjects:
Online Access:https://peerj.com/articles/651.pdf
id doaj-3badf6dba4b64d43be904002401c957a
record_format Article
spelling doaj-3badf6dba4b64d43be904002401c957a2020-11-24T23:19:49ZengPeerJ Inc.PeerJ2167-83592014-11-012e65110.7717/peerj.651651Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency departmentPaul Walsh0Justin Thornton1Julie Asato2Nicholas Walker3Gary McCoy4Joe Baal5Jed Baal6Nanse Mendoza7Faried Banimahd8Department of Emergency Medicine, University of California Davis Medical Center, Sacramento, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, Kern Medical Center, Bakersfield, CA, USADepartment of Emergency Medicine, University of California Irvine, Orange, CA, USAObjectives. To measure inter-rater agreement of overall clinical appearance of febrile children aged less than 24 months and to compare methods for doing so.Study Design and Setting. We performed an observational study of inter-rater reliability of the assessment of febrile children in a county hospital emergency department serving a mixed urban and rural population. Two emergency medicine healthcare providers independently evaluated the overall clinical appearance of children less than 24 months of age who had presented for fever. They recorded the initial ‘gestalt’ assessment of whether or not the child was ill appearing or if they were unsure. They then repeated this assessment after examining the child. Each rater was blinded to the other’s assessment. Our primary analysis was graphical. We also calculated Cohen’s κ, Gwet’s agreement coefficient and other measures of agreement and weighted variants of these. We examined the effect of time between exams and patient and provider characteristics on inter-rater agreement.Results. We analyzed 159 of the 173 patients enrolled. Median age was 9.5 months (lower and upper quartiles 4.9–14.6), 99/159 (62%) were boys and 22/159 (14%) were admitted. Overall 118/159 (74%) and 119/159 (75%) were classified as well appearing on initial ‘gestalt’ impression by both examiners. Summary statistics varied from 0.223 for weighted κ to 0.635 for Gwet’s AC2. Inter rater agreement was affected by the time interval between the evaluations and the age of the child but not by the experience levels of the rater pairs. Classifications of ‘not ill appearing’ were more reliable than others.Conclusion. The inter-rater reliability of emergency providers’ assessment of overall clinical appearance was adequate when described graphically and by Gwet’s AC. Different summary statistics yield different results for the same dataset.https://peerj.com/articles/651.pdfGwet’s ACInter-rater agreementCohen’s kappaGraphical analysisEmergency medicinePediatric
collection DOAJ
language English
format Article
sources DOAJ
author Paul Walsh
Justin Thornton
Julie Asato
Nicholas Walker
Gary McCoy
Joe Baal
Jed Baal
Nanse Mendoza
Faried Banimahd
spellingShingle Paul Walsh
Justin Thornton
Julie Asato
Nicholas Walker
Gary McCoy
Joe Baal
Jed Baal
Nanse Mendoza
Faried Banimahd
Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
PeerJ
Gwet’s AC
Inter-rater agreement
Cohen’s kappa
Graphical analysis
Emergency medicine
Pediatric
author_facet Paul Walsh
Justin Thornton
Julie Asato
Nicholas Walker
Gary McCoy
Joe Baal
Jed Baal
Nanse Mendoza
Faried Banimahd
author_sort Paul Walsh
title Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
title_short Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
title_full Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
title_fullStr Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
title_full_unstemmed Approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
title_sort approaches to describing inter-rater reliability of the overall clinical appearance of febrile infants and toddlers in the emergency department
publisher PeerJ Inc.
series PeerJ
issn 2167-8359
publishDate 2014-11-01
description Objectives. To measure inter-rater agreement of overall clinical appearance of febrile children aged less than 24 months and to compare methods for doing so.Study Design and Setting. We performed an observational study of inter-rater reliability of the assessment of febrile children in a county hospital emergency department serving a mixed urban and rural population. Two emergency medicine healthcare providers independently evaluated the overall clinical appearance of children less than 24 months of age who had presented for fever. They recorded the initial ‘gestalt’ assessment of whether or not the child was ill appearing or if they were unsure. They then repeated this assessment after examining the child. Each rater was blinded to the other’s assessment. Our primary analysis was graphical. We also calculated Cohen’s κ, Gwet’s agreement coefficient and other measures of agreement and weighted variants of these. We examined the effect of time between exams and patient and provider characteristics on inter-rater agreement.Results. We analyzed 159 of the 173 patients enrolled. Median age was 9.5 months (lower and upper quartiles 4.9–14.6), 99/159 (62%) were boys and 22/159 (14%) were admitted. Overall 118/159 (74%) and 119/159 (75%) were classified as well appearing on initial ‘gestalt’ impression by both examiners. Summary statistics varied from 0.223 for weighted κ to 0.635 for Gwet’s AC2. Inter rater agreement was affected by the time interval between the evaluations and the age of the child but not by the experience levels of the rater pairs. Classifications of ‘not ill appearing’ were more reliable than others.Conclusion. The inter-rater reliability of emergency providers’ assessment of overall clinical appearance was adequate when described graphically and by Gwet’s AC. Different summary statistics yield different results for the same dataset.
topic Gwet’s AC
Inter-rater agreement
Cohen’s kappa
Graphical analysis
Emergency medicine
Pediatric
url https://peerj.com/articles/651.pdf
work_keys_str_mv AT paulwalsh approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT justinthornton approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT julieasato approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT nicholaswalker approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT garymccoy approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT joebaal approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT jedbaal approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT nansemendoza approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
AT fariedbanimahd approachestodescribinginterraterreliabilityoftheoverallclinicalappearanceoffebrileinfantsandtoddlersintheemergencydepartment
_version_ 1725576662093922304