Accuracy of online symptom checkers and the potential impact on service utilisation.

<h4>Objectives</h4>The aims of our study are firstly to investigate the diagnostic and triage performance of symptom checkers, secondly to assess their potential impact on healthcare utilisation and thirdly to investigate for variation in performance between systems.<h4>Setting<...

Full description

Bibliographic Details
Main Authors: Adam Ceney, Stephanie Tolond, Andrzej Glowinski, Ben Marks, Simon Swift, Tom Palser
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0254088
id doaj-b2fae1df26a14587b6ef90670487a4e1
record_format Article
spelling doaj-b2fae1df26a14587b6ef90670487a4e12021-07-31T04:32:14ZengPublic Library of Science (PLoS)PLoS ONE1932-62032021-01-01167e025408810.1371/journal.pone.0254088Accuracy of online symptom checkers and the potential impact on service utilisation.Adam CeneyStephanie TolondAndrzej GlowinskiBen MarksSimon SwiftTom Palser<h4>Objectives</h4>The aims of our study are firstly to investigate the diagnostic and triage performance of symptom checkers, secondly to assess their potential impact on healthcare utilisation and thirdly to investigate for variation in performance between systems.<h4>Setting</h4>Publicly available symptom checkers for patient use.<h4>Participants</h4>Publicly available symptom-checkers were identified. A standardised set of 50 clinical vignettes were developed and systematically run through each system by a non-clinical researcher.<h4>Primary and secondary outcome measures</h4>System accuracy was assessed by measuring the percentage of times the correct diagnosis was a) listed first, b) within the top five diagnoses listed and c) listed at all. The safety of the disposition advice was assessed by comparing it with national guidelines for each vignette.<h4>Results</h4>Twelve tools were identified and included. Mean diagnostic accuracy of the systems was poor, with the correct diagnosis being present in the top five diagnoses on 51.0% (Range 22.2 to 84.0%). Safety of disposition advice decreased with condition urgency (being 71.8% for emergency cases vs 87.3% for non-urgent cases). 51.0% of systems suggested additional resource utilisation above that recommended by national guidelines (range 18.0% to 61.2%). Both diagnostic accuracy and appropriate resource recommendation varied substantially between systems.<h4>Conclusions</h4>There is wide variation in performance between available symptom checkers and overall performance is significantly below what would be accepted in any other medical field, though some do achieve a good level of accuracy and safety of disposition. External validation and regulation are urgently required to ensure these public facing tools are safe.https://doi.org/10.1371/journal.pone.0254088
collection DOAJ
language English
format Article
sources DOAJ
author Adam Ceney
Stephanie Tolond
Andrzej Glowinski
Ben Marks
Simon Swift
Tom Palser
spellingShingle Adam Ceney
Stephanie Tolond
Andrzej Glowinski
Ben Marks
Simon Swift
Tom Palser
Accuracy of online symptom checkers and the potential impact on service utilisation.
PLoS ONE
author_facet Adam Ceney
Stephanie Tolond
Andrzej Glowinski
Ben Marks
Simon Swift
Tom Palser
author_sort Adam Ceney
title Accuracy of online symptom checkers and the potential impact on service utilisation.
title_short Accuracy of online symptom checkers and the potential impact on service utilisation.
title_full Accuracy of online symptom checkers and the potential impact on service utilisation.
title_fullStr Accuracy of online symptom checkers and the potential impact on service utilisation.
title_full_unstemmed Accuracy of online symptom checkers and the potential impact on service utilisation.
title_sort accuracy of online symptom checkers and the potential impact on service utilisation.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2021-01-01
description <h4>Objectives</h4>The aims of our study are firstly to investigate the diagnostic and triage performance of symptom checkers, secondly to assess their potential impact on healthcare utilisation and thirdly to investigate for variation in performance between systems.<h4>Setting</h4>Publicly available symptom checkers for patient use.<h4>Participants</h4>Publicly available symptom-checkers were identified. A standardised set of 50 clinical vignettes were developed and systematically run through each system by a non-clinical researcher.<h4>Primary and secondary outcome measures</h4>System accuracy was assessed by measuring the percentage of times the correct diagnosis was a) listed first, b) within the top five diagnoses listed and c) listed at all. The safety of the disposition advice was assessed by comparing it with national guidelines for each vignette.<h4>Results</h4>Twelve tools were identified and included. Mean diagnostic accuracy of the systems was poor, with the correct diagnosis being present in the top five diagnoses on 51.0% (Range 22.2 to 84.0%). Safety of disposition advice decreased with condition urgency (being 71.8% for emergency cases vs 87.3% for non-urgent cases). 51.0% of systems suggested additional resource utilisation above that recommended by national guidelines (range 18.0% to 61.2%). Both diagnostic accuracy and appropriate resource recommendation varied substantially between systems.<h4>Conclusions</h4>There is wide variation in performance between available symptom checkers and overall performance is significantly below what would be accepted in any other medical field, though some do achieve a good level of accuracy and safety of disposition. External validation and regulation are urgently required to ensure these public facing tools are safe.
url https://doi.org/10.1371/journal.pone.0254088
work_keys_str_mv AT adamceney accuracyofonlinesymptomcheckersandthepotentialimpactonserviceutilisation
AT stephanietolond accuracyofonlinesymptomcheckersandthepotentialimpactonserviceutilisation
AT andrzejglowinski accuracyofonlinesymptomcheckersandthepotentialimpactonserviceutilisation
AT benmarks accuracyofonlinesymptomcheckersandthepotentialimpactonserviceutilisation
AT simonswift accuracyofonlinesymptomcheckersandthepotentialimpactonserviceutilisation
AT tompalser accuracyofonlinesymptomcheckersandthepotentialimpactonserviceutilisation
_version_ 1721247170999353344