Querying archetype-based EHRs by search ontology-based XPath engineering

Abstract Background Legacy data and new structured data can be stored in a standardized format as XML-based EHRs on XML databases. Querying documents on these databases is crucial for answering research questions. Instead of using free text searches, that lead to false positive results, the precisio...

Full description

Bibliographic Details
Main Authors: Stefan Kropf, Alexandr Uciteli, Katrin Schierle, Peter Krücken, Kerstin Denecke, Heinrich Herre
Format: Article
Language:English
Published: BMC 2018-05-01
Series:Journal of Biomedical Semantics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s13326-018-0180-2
id doaj-0c5f9dc6640d4c2d8ffed15d53fe8669
record_format Article
spelling doaj-0c5f9dc6640d4c2d8ffed15d53fe86692020-11-24T21:39:05ZengBMCJournal of Biomedical Semantics2041-14802018-05-019111410.1186/s13326-018-0180-2Querying archetype-based EHRs by search ontology-based XPath engineeringStefan Kropf0Alexandr Uciteli1Katrin Schierle2Peter Krücken3Kerstin Denecke4Heinrich Herre5Institute for Medical Informatics, Statistics and Epidemiology (IMISE), Leipzig UniversityInstitute for Medical Informatics, Statistics and Epidemiology (IMISE), Leipzig UniversityInstitute of Pathology, Leipzig University HospitalInstitute of Pathology, Leipzig University HospitalInstitute for Medical Informatics, Bern University of Applied ScienceInstitute for Medical Informatics, Statistics and Epidemiology (IMISE), Leipzig UniversityAbstract Background Legacy data and new structured data can be stored in a standardized format as XML-based EHRs on XML databases. Querying documents on these databases is crucial for answering research questions. Instead of using free text searches, that lead to false positive results, the precision can be increased by constraining the search to certain parts of documents. Methods A search ontology-based specification of queries on XML documents defines search concepts and relates them to parts in the XML document structure. Such query specification method is practically introduced and evaluated by applying concrete research questions formulated in natural language on a data collection for information retrieval purposes. The search is performed by search ontology-based XPath engineering that reuses ontologies and XML-related W3C standards. Results The key result is that the specification of research questions can be supported by the usage of search ontology-based XPath engineering. A deeper recognition of entities and a semantic understanding of the content is necessary for a further improvement of precision and recall. Key limitation is that the application of the introduced process requires skills in ontology and software development. In future, the time consuming ontology development could be overcome by implementing a new clinical role: the clinical ontologist. Conclusion The introduced Search Ontology XML extension connects Search Terms to certain parts in XML documents and enables an ontology-based definition of queries. Search ontology-based XPath engineering can support research question answering by the specification of complex XPath expressions without deep syntax knowledge about XPaths.http://link.springer.com/article/10.1186/s13326-018-0180-2Electronic health recordsMedical informatics applicationsSearch ontologyInformation retrievalEHR queryPathology electronic health records
collection DOAJ
language English
format Article
sources DOAJ
author Stefan Kropf
Alexandr Uciteli
Katrin Schierle
Peter Krücken
Kerstin Denecke
Heinrich Herre
spellingShingle Stefan Kropf
Alexandr Uciteli
Katrin Schierle
Peter Krücken
Kerstin Denecke
Heinrich Herre
Querying archetype-based EHRs by search ontology-based XPath engineering
Journal of Biomedical Semantics
Electronic health records
Medical informatics applications
Search ontology
Information retrieval
EHR query
Pathology electronic health records
author_facet Stefan Kropf
Alexandr Uciteli
Katrin Schierle
Peter Krücken
Kerstin Denecke
Heinrich Herre
author_sort Stefan Kropf
title Querying archetype-based EHRs by search ontology-based XPath engineering
title_short Querying archetype-based EHRs by search ontology-based XPath engineering
title_full Querying archetype-based EHRs by search ontology-based XPath engineering
title_fullStr Querying archetype-based EHRs by search ontology-based XPath engineering
title_full_unstemmed Querying archetype-based EHRs by search ontology-based XPath engineering
title_sort querying archetype-based ehrs by search ontology-based xpath engineering
publisher BMC
series Journal of Biomedical Semantics
issn 2041-1480
publishDate 2018-05-01
description Abstract Background Legacy data and new structured data can be stored in a standardized format as XML-based EHRs on XML databases. Querying documents on these databases is crucial for answering research questions. Instead of using free text searches, that lead to false positive results, the precision can be increased by constraining the search to certain parts of documents. Methods A search ontology-based specification of queries on XML documents defines search concepts and relates them to parts in the XML document structure. Such query specification method is practically introduced and evaluated by applying concrete research questions formulated in natural language on a data collection for information retrieval purposes. The search is performed by search ontology-based XPath engineering that reuses ontologies and XML-related W3C standards. Results The key result is that the specification of research questions can be supported by the usage of search ontology-based XPath engineering. A deeper recognition of entities and a semantic understanding of the content is necessary for a further improvement of precision and recall. Key limitation is that the application of the introduced process requires skills in ontology and software development. In future, the time consuming ontology development could be overcome by implementing a new clinical role: the clinical ontologist. Conclusion The introduced Search Ontology XML extension connects Search Terms to certain parts in XML documents and enables an ontology-based definition of queries. Search ontology-based XPath engineering can support research question answering by the specification of complex XPath expressions without deep syntax knowledge about XPaths.
topic Electronic health records
Medical informatics applications
Search ontology
Information retrieval
EHR query
Pathology electronic health records
url http://link.springer.com/article/10.1186/s13326-018-0180-2
work_keys_str_mv AT stefankropf queryingarchetypebasedehrsbysearchontologybasedxpathengineering
AT alexandruciteli queryingarchetypebasedehrsbysearchontologybasedxpathengineering
AT katrinschierle queryingarchetypebasedehrsbysearchontologybasedxpathengineering
AT peterkrucken queryingarchetypebasedehrsbysearchontologybasedxpathengineering
AT kerstindenecke queryingarchetypebasedehrsbysearchontologybasedxpathengineering
AT heinrichherre queryingarchetypebasedehrsbysearchontologybasedxpathengineering
_version_ 1725932756083408896