The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond

Abstract Background The current COVID-19 pandemic has led to a surge of research activity. While this research provides important insights, the multitude of studies results in an increasing fragmentation of information. To ensure comparability across projects and institutions, standard datasets are...

Full description

Bibliographic Details
Main Authors: Julian Sass, Alexander Bartschke, Moritz Lehne, Andrea Essenwanger, Eugenia Rinaldi, Stefanie Rudolph, Kai U. Heitmann, Jörg J. Vehreschild, Christof von Kalle, Sylvia Thun
Format: Article
Language:English
Published: BMC 2020-12-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-020-01374-w
id doaj-f9f0b674409d4987b8e239b7fe118b41
record_format Article
spelling doaj-f9f0b674409d4987b8e239b7fe118b412020-12-27T12:18:27ZengBMCBMC Medical Informatics and Decision Making1472-69472020-12-012011710.1186/s12911-020-01374-wThe German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyondJulian Sass0Alexander Bartschke1Moritz Lehne2Andrea Essenwanger3Eugenia Rinaldi4Stefanie Rudolph5Kai U. Heitmann6Jörg J. Vehreschild7Christof von Kalle8Sylvia Thun9Berlin Institute of Health (BIH)Charité – Universitätsmedizin BerlinBerlin Institute of Health (BIH)Berlin Institute of Health (BIH)Charité – Universitätsmedizin BerlinCharité – Universitätsmedizin BerlinHIH – Health Innovation Hub of the Federal Ministry of HealthMedical Department 2, Hematology/Oncology, University Hospital of FrankfurtBerlin Institute of Health (BIH)Berlin Institute of Health (BIH)Abstract Background The current COVID-19 pandemic has led to a surge of research activity. While this research provides important insights, the multitude of studies results in an increasing fragmentation of information. To ensure comparability across projects and institutions, standard datasets are needed. Here, we introduce the “German Corona Consensus Dataset” (GECCO), a uniform dataset that uses international terminologies and health IT standards to improve interoperability of COVID-19 data, in particular for university medicine. Methods Based on previous work (e.g., the ISARIC-WHO COVID-19 case report form) and in coordination with experts from university hospitals, professional associations and research initiatives, data elements relevant for COVID-19 research were collected, prioritized and consolidated into a compact core dataset. The dataset was mapped to international terminologies, and the Fast Healthcare Interoperability Resources (FHIR) standard was used to define interoperable, machine-readable data formats. Results A core dataset consisting of 81 data elements with 281 response options was defined, including information about, for example, demography, medical history, symptoms, therapy, medications or laboratory values of COVID-19 patients. Data elements and response options were mapped to SNOMED CT, LOINC, UCUM, ICD-10-GM and ATC, and FHIR profiles for interoperable data exchange were defined. Conclusion GECCO provides a compact, interoperable dataset that can help to make COVID-19 research data more comparable across studies and institutions. The dataset will be further refined in the future by adding domain-specific extension modules for more specialized use cases.https://doi.org/10.1186/s12911-020-01374-wCOVID-19InteroperabilityStandard datasetFHIR
collection DOAJ
language English
format Article
sources DOAJ
author Julian Sass
Alexander Bartschke
Moritz Lehne
Andrea Essenwanger
Eugenia Rinaldi
Stefanie Rudolph
Kai U. Heitmann
Jörg J. Vehreschild
Christof von Kalle
Sylvia Thun
spellingShingle Julian Sass
Alexander Bartschke
Moritz Lehne
Andrea Essenwanger
Eugenia Rinaldi
Stefanie Rudolph
Kai U. Heitmann
Jörg J. Vehreschild
Christof von Kalle
Sylvia Thun
The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond
BMC Medical Informatics and Decision Making
COVID-19
Interoperability
Standard dataset
FHIR
author_facet Julian Sass
Alexander Bartschke
Moritz Lehne
Andrea Essenwanger
Eugenia Rinaldi
Stefanie Rudolph
Kai U. Heitmann
Jörg J. Vehreschild
Christof von Kalle
Sylvia Thun
author_sort Julian Sass
title The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond
title_short The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond
title_full The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond
title_fullStr The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond
title_full_unstemmed The German Corona Consensus Dataset (GECCO): a standardized dataset for COVID-19 research in university medicine and beyond
title_sort german corona consensus dataset (gecco): a standardized dataset for covid-19 research in university medicine and beyond
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2020-12-01
description Abstract Background The current COVID-19 pandemic has led to a surge of research activity. While this research provides important insights, the multitude of studies results in an increasing fragmentation of information. To ensure comparability across projects and institutions, standard datasets are needed. Here, we introduce the “German Corona Consensus Dataset” (GECCO), a uniform dataset that uses international terminologies and health IT standards to improve interoperability of COVID-19 data, in particular for university medicine. Methods Based on previous work (e.g., the ISARIC-WHO COVID-19 case report form) and in coordination with experts from university hospitals, professional associations and research initiatives, data elements relevant for COVID-19 research were collected, prioritized and consolidated into a compact core dataset. The dataset was mapped to international terminologies, and the Fast Healthcare Interoperability Resources (FHIR) standard was used to define interoperable, machine-readable data formats. Results A core dataset consisting of 81 data elements with 281 response options was defined, including information about, for example, demography, medical history, symptoms, therapy, medications or laboratory values of COVID-19 patients. Data elements and response options were mapped to SNOMED CT, LOINC, UCUM, ICD-10-GM and ATC, and FHIR profiles for interoperable data exchange were defined. Conclusion GECCO provides a compact, interoperable dataset that can help to make COVID-19 research data more comparable across studies and institutions. The dataset will be further refined in the future by adding domain-specific extension modules for more specialized use cases.
topic COVID-19
Interoperability
Standard dataset
FHIR
url https://doi.org/10.1186/s12911-020-01374-w
work_keys_str_mv AT juliansass thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT alexanderbartschke thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT moritzlehne thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT andreaessenwanger thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT eugeniarinaldi thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT stefanierudolph thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT kaiuheitmann thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT jorgjvehreschild thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT christofvonkalle thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT sylviathun thegermancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT juliansass germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT alexanderbartschke germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT moritzlehne germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT andreaessenwanger germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT eugeniarinaldi germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT stefanierudolph germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT kaiuheitmann germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT jorgjvehreschild germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT christofvonkalle germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
AT sylviathun germancoronaconsensusdatasetgeccoastandardizeddatasetforcovid19researchinuniversitymedicineandbeyond
_version_ 1724369229410467840