Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)

IntroductionAs the health system seeks to leverage large-scale data to inform population outcomes, the informatics community is developing tools for analysing these data. To support data quality assessment within such a tool, we extended the open-source software Observational Health Data Sciences an...

Full description

Bibliographic Details
Main Authors: Brian E Dixon, Chen Wen, Tony French, Jennifer L Williams, Jon D Duke, Shaun J Grannis
Format: Article
Language:English
Published: BMJ Publishing Group 2020-07-01
Series:BMJ Health & Care Informatics
Online Access:https://informatics.bmj.com/content/27/1/e100054.full
id doaj-c60d15535af14e7fad1c08fdda145015
record_format Article
spelling doaj-c60d15535af14e7fad1c08fdda1450152020-12-14T15:13:53ZengBMJ Publishing GroupBMJ Health & Care Informatics2632-10092020-07-0127110.1136/bmjhci-2019-100054Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)Brian E DixonChen Wen0Tony French1Jennifer L Williams2Jon D Duke3Shaun J Grannis4Center for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USACenter for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USACenter for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USACenter for Health Analytics and Informatics, Georgia Tech Research Institute, Atlanta, Georgia, USACenter for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USAIntroductionAs the health system seeks to leverage large-scale data to inform population outcomes, the informatics community is developing tools for analysing these data. To support data quality assessment within such a tool, we extended the open-source software Observational Health Data Sciences and Informatics (OHDSI) to incorporate new functions useful for population health.MethodsWe developed and tested methods to measure the completeness, timeliness and entropy of information. The new data quality methods were applied to over 100 million clinical messages received from emergency department information systems for use in public health syndromic surveillance systems.DiscussionWhile completeness and entropy methods were implemented by the OHDSI community, timeliness was not adopted as its context did not fit with the existing OHDSI domains. The case report examines the process and reasons for acceptance and rejection of ideas proposed to an open-source community like OHDSI.https://informatics.bmj.com/content/27/1/e100054.full
collection DOAJ
language English
format Article
sources DOAJ
author Brian E Dixon
Chen Wen
Tony French
Jennifer L Williams
Jon D Duke
Shaun J Grannis
spellingShingle Brian E Dixon
Chen Wen
Tony French
Jennifer L Williams
Jon D Duke
Shaun J Grannis
Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
BMJ Health & Care Informatics
author_facet Brian E Dixon
Chen Wen
Tony French
Jennifer L Williams
Jon D Duke
Shaun J Grannis
author_sort Brian E Dixon
title Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
title_short Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
title_full Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
title_fullStr Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
title_full_unstemmed Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
title_sort extending an open-source tool to measure data quality: case report on observational health data science and informatics (ohdsi)
publisher BMJ Publishing Group
series BMJ Health & Care Informatics
issn 2632-1009
publishDate 2020-07-01
description IntroductionAs the health system seeks to leverage large-scale data to inform population outcomes, the informatics community is developing tools for analysing these data. To support data quality assessment within such a tool, we extended the open-source software Observational Health Data Sciences and Informatics (OHDSI) to incorporate new functions useful for population health.MethodsWe developed and tested methods to measure the completeness, timeliness and entropy of information. The new data quality methods were applied to over 100 million clinical messages received from emergency department information systems for use in public health syndromic surveillance systems.DiscussionWhile completeness and entropy methods were implemented by the OHDSI community, timeliness was not adopted as its context did not fit with the existing OHDSI domains. The case report examines the process and reasons for acceptance and rejection of ideas proposed to an open-source community like OHDSI.
url https://informatics.bmj.com/content/27/1/e100054.full
work_keys_str_mv AT brianedixon extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi
AT chenwen extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi
AT tonyfrench extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi
AT jenniferlwilliams extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi
AT jondduke extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi
AT shaunjgrannis extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi
_version_ 1724383508591280128