Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)
IntroductionAs the health system seeks to leverage large-scale data to inform population outcomes, the informatics community is developing tools for analysing these data. To support data quality assessment within such a tool, we extended the open-source software Observational Health Data Sciences an...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMJ Publishing Group
2020-07-01
|
Series: | BMJ Health & Care Informatics |
Online Access: | https://informatics.bmj.com/content/27/1/e100054.full |
id |
doaj-c60d15535af14e7fad1c08fdda145015 |
---|---|
record_format |
Article |
spelling |
doaj-c60d15535af14e7fad1c08fdda1450152020-12-14T15:13:53ZengBMJ Publishing GroupBMJ Health & Care Informatics2632-10092020-07-0127110.1136/bmjhci-2019-100054Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI)Brian E DixonChen Wen0Tony French1Jennifer L Williams2Jon D Duke3Shaun J Grannis4Center for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USACenter for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USACenter for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USACenter for Health Analytics and Informatics, Georgia Tech Research Institute, Atlanta, Georgia, USACenter for Biomedical Informatics, Regenstrief Institute Inc, Indianapolis, Indiana, USAIntroductionAs the health system seeks to leverage large-scale data to inform population outcomes, the informatics community is developing tools for analysing these data. To support data quality assessment within such a tool, we extended the open-source software Observational Health Data Sciences and Informatics (OHDSI) to incorporate new functions useful for population health.MethodsWe developed and tested methods to measure the completeness, timeliness and entropy of information. The new data quality methods were applied to over 100 million clinical messages received from emergency department information systems for use in public health syndromic surveillance systems.DiscussionWhile completeness and entropy methods were implemented by the OHDSI community, timeliness was not adopted as its context did not fit with the existing OHDSI domains. The case report examines the process and reasons for acceptance and rejection of ideas proposed to an open-source community like OHDSI.https://informatics.bmj.com/content/27/1/e100054.full |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Brian E Dixon Chen Wen Tony French Jennifer L Williams Jon D Duke Shaun J Grannis |
spellingShingle |
Brian E Dixon Chen Wen Tony French Jennifer L Williams Jon D Duke Shaun J Grannis Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI) BMJ Health & Care Informatics |
author_facet |
Brian E Dixon Chen Wen Tony French Jennifer L Williams Jon D Duke Shaun J Grannis |
author_sort |
Brian E Dixon |
title |
Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI) |
title_short |
Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI) |
title_full |
Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI) |
title_fullStr |
Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI) |
title_full_unstemmed |
Extending an open-source tool to measure data quality: case report on Observational Health Data Science and Informatics (OHDSI) |
title_sort |
extending an open-source tool to measure data quality: case report on observational health data science and informatics (ohdsi) |
publisher |
BMJ Publishing Group |
series |
BMJ Health & Care Informatics |
issn |
2632-1009 |
publishDate |
2020-07-01 |
description |
IntroductionAs the health system seeks to leverage large-scale data to inform population outcomes, the informatics community is developing tools for analysing these data. To support data quality assessment within such a tool, we extended the open-source software Observational Health Data Sciences and Informatics (OHDSI) to incorporate new functions useful for population health.MethodsWe developed and tested methods to measure the completeness, timeliness and entropy of information. The new data quality methods were applied to over 100 million clinical messages received from emergency department information systems for use in public health syndromic surveillance systems.DiscussionWhile completeness and entropy methods were implemented by the OHDSI community, timeliness was not adopted as its context did not fit with the existing OHDSI domains. The case report examines the process and reasons for acceptance and rejection of ideas proposed to an open-source community like OHDSI. |
url |
https://informatics.bmj.com/content/27/1/e100054.full |
work_keys_str_mv |
AT brianedixon extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi AT chenwen extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi AT tonyfrench extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi AT jenniferlwilliams extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi AT jondduke extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi AT shaunjgrannis extendinganopensourcetooltomeasuredataqualitycasereportonobservationalhealthdatascienceandinformaticsohdsi |
_version_ |
1724383508591280128 |