Barriers to Working With National Health Service England’s Open Data

Open data is information made freely available to third parties in structured formats without restrictive licensing conditions, permitting commercial and noncommercial organizations to innovate. In the context of National Health Service (NHS) data, this is intended to improve patient outc...

Full description

Bibliographic Details
Main Authors: Bacon, Seb, Goldacre, Ben
Format: Article
Language:English
Published: JMIR Publications 2020-01-01
Series:Journal of Medical Internet Research
Online Access:https://www.jmir.org/2020/1/e15603
id doaj-20e61f0134f049e8b336b40818be429f
record_format Article
spelling doaj-20e61f0134f049e8b336b40818be429f2021-04-02T18:41:04ZengJMIR PublicationsJournal of Medical Internet Research1438-88712020-01-01221e1560310.2196/15603Barriers to Working With National Health Service England’s Open DataBacon, SebGoldacre, Ben Open data is information made freely available to third parties in structured formats without restrictive licensing conditions, permitting commercial and noncommercial organizations to innovate. In the context of National Health Service (NHS) data, this is intended to improve patient outcomes and efficiency. EBM DataLab is a research group with a focus on online tools which turn our research findings into actionable monthly outputs. We regularly import and process more than 15 different NHS open datasets to deliver OpenPrescribing.net, one of the most high-impact use cases for NHS England’s open data, with over 15,000 unique users each month. In this paper, we have described the many breaches of best practices around NHS open data that we have encountered. Examples include datasets that repeatedly change location without warning or forwarding; datasets that are needlessly behind a “CAPTCHA” and so cannot be automatically downloaded; longitudinal datasets that change their structure without warning or documentation; near-duplicate datasets with unexplained differences; datasets that are impossible to locate, and thus may or may not exist; poor or absent documentation; and withholding of data for dubious reasons. We propose new open ways of working that will support better analytics for all users of the NHS. These include better curation, better documentation, and systems for better dialogue with technical teams.https://www.jmir.org/2020/1/e15603
collection DOAJ
language English
format Article
sources DOAJ
author Bacon, Seb
Goldacre, Ben
spellingShingle Bacon, Seb
Goldacre, Ben
Barriers to Working With National Health Service England’s Open Data
Journal of Medical Internet Research
author_facet Bacon, Seb
Goldacre, Ben
author_sort Bacon, Seb
title Barriers to Working With National Health Service England’s Open Data
title_short Barriers to Working With National Health Service England’s Open Data
title_full Barriers to Working With National Health Service England’s Open Data
title_fullStr Barriers to Working With National Health Service England’s Open Data
title_full_unstemmed Barriers to Working With National Health Service England’s Open Data
title_sort barriers to working with national health service england’s open data
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2020-01-01
description Open data is information made freely available to third parties in structured formats without restrictive licensing conditions, permitting commercial and noncommercial organizations to innovate. In the context of National Health Service (NHS) data, this is intended to improve patient outcomes and efficiency. EBM DataLab is a research group with a focus on online tools which turn our research findings into actionable monthly outputs. We regularly import and process more than 15 different NHS open datasets to deliver OpenPrescribing.net, one of the most high-impact use cases for NHS England’s open data, with over 15,000 unique users each month. In this paper, we have described the many breaches of best practices around NHS open data that we have encountered. Examples include datasets that repeatedly change location without warning or forwarding; datasets that are needlessly behind a “CAPTCHA” and so cannot be automatically downloaded; longitudinal datasets that change their structure without warning or documentation; near-duplicate datasets with unexplained differences; datasets that are impossible to locate, and thus may or may not exist; poor or absent documentation; and withholding of data for dubious reasons. We propose new open ways of working that will support better analytics for all users of the NHS. These include better curation, better documentation, and systems for better dialogue with technical teams.
url https://www.jmir.org/2020/1/e15603
work_keys_str_mv AT baconseb barrierstoworkingwithnationalhealthserviceenglandsopendata
AT goldacreben barrierstoworkingwithnationalhealthserviceenglandsopendata
_version_ 1721551125086208000