A tool to improve the efficiency and reproducibility of research using electronic health record databases
Background Interrogation of electronic health record databases often involves time-consuming, manual, repetitive work in developing database queries. We developed a tool to automate this process. Methods We identified elementary approaches to query primary care data from the Secure Anonymised In...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Swansea University
2018-06-01
|
Series: | International Journal of Population Data Science |
Online Access: | https://ijpds.org/article/view/540 |
id |
doaj-f0ce2f57ca81427d8c1d7ed95c24196d |
---|---|
record_format |
Article |
spelling |
doaj-f0ce2f57ca81427d8c1d7ed95c24196d2020-11-24T21:39:26ZengSwansea UniversityInternational Journal of Population Data Science2399-49082018-06-013210.23889/ijpds.v3i2.540540A tool to improve the efficiency and reproducibility of research using electronic health record databasesMohammad Al Sallakh0Sarah RodgersRonan Lyons1Aziz Sheikh2Gwyneth Davies3Swansea University Medical SchoolSwansea UniversityUsher Institute of Population Health Sciences and Informatics, The University of EdinburghSwansea University Medical School Background Interrogation of electronic health record databases often involves time-consuming, manual, repetitive work in developing database queries. We developed a tool to automate this process. Methods We identified elementary approaches to query primary care data from the Secure Anonymised Information Linkage databank of Wales. We designed a web-based query builder that allows using combinations of these approaches as ‘building blocks’ to query complex variables. We created an R programme to automatically generate and execute the corresponding Structured Query Language queries. Results The tool allows data extraction using combinations of the following methods: event count (e.g., asthma prescriptions); code/date of earliest/latest event; code/date/value of the event of maximum/minimum value; and frequency of temporally constrained events. Query intervals could be fixed, dynamic, or individualised. The tool integrates with a codeset repository. Data extraction procedures and codesets are saved on a web server as versioned, shareable, and citable objects. Conclusion This versatile tool allows rapid and complex data extraction with minimal to no programming skills, reduces human errors, and improves research transparency and reproducibility. Funding/Support Health and Care Research Wales, ABMU Health Board, AUKCAR (AUK-AC-2012-01), Farr Institute of Health Informatics Research (MR/K006525/1-MR/K007017/1). https://ijpds.org/article/view/540 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Mohammad Al Sallakh Sarah Rodgers Ronan Lyons Aziz Sheikh Gwyneth Davies |
spellingShingle |
Mohammad Al Sallakh Sarah Rodgers Ronan Lyons Aziz Sheikh Gwyneth Davies A tool to improve the efficiency and reproducibility of research using electronic health record databases International Journal of Population Data Science |
author_facet |
Mohammad Al Sallakh Sarah Rodgers Ronan Lyons Aziz Sheikh Gwyneth Davies |
author_sort |
Mohammad Al Sallakh |
title |
A tool to improve the efficiency and reproducibility of research using electronic health record databases |
title_short |
A tool to improve the efficiency and reproducibility of research using electronic health record databases |
title_full |
A tool to improve the efficiency and reproducibility of research using electronic health record databases |
title_fullStr |
A tool to improve the efficiency and reproducibility of research using electronic health record databases |
title_full_unstemmed |
A tool to improve the efficiency and reproducibility of research using electronic health record databases |
title_sort |
tool to improve the efficiency and reproducibility of research using electronic health record databases |
publisher |
Swansea University |
series |
International Journal of Population Data Science |
issn |
2399-4908 |
publishDate |
2018-06-01 |
description |
Background
Interrogation of electronic health record databases often involves time-consuming, manual, repetitive work in developing database queries. We developed a tool to automate this process.
Methods
We identified elementary approaches to query primary care data from the Secure Anonymised Information Linkage databank of Wales. We designed a web-based query builder that allows using combinations of these approaches as ‘building blocks’ to query complex variables. We created an R programme to automatically generate and execute the corresponding Structured Query Language queries.
Results
The tool allows data extraction using combinations of the following methods: event count (e.g., asthma prescriptions); code/date of earliest/latest event; code/date/value of the event of maximum/minimum value; and frequency of temporally constrained events. Query intervals could be fixed, dynamic, or individualised. The tool integrates with a codeset repository. Data extraction procedures and codesets are saved on a web server as versioned, shareable, and citable objects.
Conclusion
This versatile tool allows rapid and complex data extraction with minimal to no programming skills, reduces human errors, and improves research transparency and reproducibility.
Funding/Support
Health and Care Research Wales, ABMU Health Board, AUKCAR (AUK-AC-2012-01), Farr Institute of Health Informatics Research (MR/K006525/1-MR/K007017/1).
|
url |
https://ijpds.org/article/view/540 |
work_keys_str_mv |
AT mohammadalsallakh atooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT sarahrodgers atooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT ronanlyons atooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT azizsheikh atooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT gwynethdavies atooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT mohammadalsallakh tooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT sarahrodgers tooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT ronanlyons tooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT azizsheikh tooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases AT gwynethdavies tooltoimprovetheefficiencyandreproducibilityofresearchusingelectronichealthrecorddatabases |
_version_ |
1725931467598462976 |