Automatic Process Comparison for Subpopulations: Application in Cancer Care

Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for qual...

Full description

Bibliographic Details
Main Authors: Francesca Marazza, Faiza Allah Bukhsh, Jeroen Geerdink, Onno Vijlbrief, Shreyasi Pathak, Maurice van Keulen, and Christin Seifert
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:International Journal of Environmental Research and Public Health
Subjects:
Online Access:https://www.mdpi.com/1660-4601/17/16/5707
id doaj-c1a1fbd60649494e9ece54b5c171c56d
record_format Article
spelling doaj-c1a1fbd60649494e9ece54b5c171c56d2020-11-25T03:03:31ZengMDPI AGInternational Journal of Environmental Research and Public Health1661-78271660-46012020-08-01175707570710.3390/ijerph17165707Automatic Process Comparison for Subpopulations: Application in Cancer CareFrancesca Marazza0Faiza Allah Bukhsh1Jeroen Geerdink2Onno Vijlbrief3Shreyasi Pathak4Maurice van Keulen5and Christin Seifert6Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Hospital Group Twente (ZGT), 7555 DL Hengelo, The Netherlands; j.geerdink@zgt.nl (J.G.); o.vijlbrief@zgt.nl (O.V.)Hospital Group Twente (ZGT), 7555 DL Hengelo, The Netherlands; j.geerdink@zgt.nl (J.G.); o.vijlbrief@zgt.nl (O.V.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for quality control and improvement. Process discovery from event data in electronic health records can shed light on the patient flows, but their comparison for different populations is cumbersome and time-consuming. In this paper, we present an approach for the automatic comparison of process models that were extracted from events in electronic health records. Concretely, we propose comparing processes for different patient populations by cross-log conformance checking, and standard graph similarity measures obtained from the directed graph underlying the process model. We perform a user study with 20 participants in order to obtain a ground truth for similarity of process models. We evaluate our approach on two data sets, the publicly available MIMIC database with the focus on different cancer patients in intensive care, and a database on breast cancer patients from a Dutch hospital. In our experiments, we found average fitness to be a good indicator for visual similarity in the ZGT use case, while the average precision and graph edit distance are strongly correlated with visual impression for cancer process models on MIMIC. These results are a call for further research and evaluation for determining which similarity or combination of similarities is needed in which type of process model comparison.https://www.mdpi.com/1660-4601/17/16/5707process miningprocess comparisonquality controlcancer typesbreast cancer careMIMIC database
collection DOAJ
language English
format Article
sources DOAJ
author Francesca Marazza
Faiza Allah Bukhsh
Jeroen Geerdink
Onno Vijlbrief
Shreyasi Pathak
Maurice van Keulen
and Christin Seifert
spellingShingle Francesca Marazza
Faiza Allah Bukhsh
Jeroen Geerdink
Onno Vijlbrief
Shreyasi Pathak
Maurice van Keulen
and Christin Seifert
Automatic Process Comparison for Subpopulations: Application in Cancer Care
International Journal of Environmental Research and Public Health
process mining
process comparison
quality control
cancer types
breast cancer care
MIMIC database
author_facet Francesca Marazza
Faiza Allah Bukhsh
Jeroen Geerdink
Onno Vijlbrief
Shreyasi Pathak
Maurice van Keulen
and Christin Seifert
author_sort Francesca Marazza
title Automatic Process Comparison for Subpopulations: Application in Cancer Care
title_short Automatic Process Comparison for Subpopulations: Application in Cancer Care
title_full Automatic Process Comparison for Subpopulations: Application in Cancer Care
title_fullStr Automatic Process Comparison for Subpopulations: Application in Cancer Care
title_full_unstemmed Automatic Process Comparison for Subpopulations: Application in Cancer Care
title_sort automatic process comparison for subpopulations: application in cancer care
publisher MDPI AG
series International Journal of Environmental Research and Public Health
issn 1661-7827
1660-4601
publishDate 2020-08-01
description Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for quality control and improvement. Process discovery from event data in electronic health records can shed light on the patient flows, but their comparison for different populations is cumbersome and time-consuming. In this paper, we present an approach for the automatic comparison of process models that were extracted from events in electronic health records. Concretely, we propose comparing processes for different patient populations by cross-log conformance checking, and standard graph similarity measures obtained from the directed graph underlying the process model. We perform a user study with 20 participants in order to obtain a ground truth for similarity of process models. We evaluate our approach on two data sets, the publicly available MIMIC database with the focus on different cancer patients in intensive care, and a database on breast cancer patients from a Dutch hospital. In our experiments, we found average fitness to be a good indicator for visual similarity in the ZGT use case, while the average precision and graph edit distance are strongly correlated with visual impression for cancer process models on MIMIC. These results are a call for further research and evaluation for determining which similarity or combination of similarities is needed in which type of process model comparison.
topic process mining
process comparison
quality control
cancer types
breast cancer care
MIMIC database
url https://www.mdpi.com/1660-4601/17/16/5707
work_keys_str_mv AT francescamarazza automaticprocesscomparisonforsubpopulationsapplicationincancercare
AT faizaallahbukhsh automaticprocesscomparisonforsubpopulationsapplicationincancercare
AT jeroengeerdink automaticprocesscomparisonforsubpopulationsapplicationincancercare
AT onnovijlbrief automaticprocesscomparisonforsubpopulationsapplicationincancercare
AT shreyasipathak automaticprocesscomparisonforsubpopulationsapplicationincancercare
AT mauricevankeulen automaticprocesscomparisonforsubpopulationsapplicationincancercare
AT andchristinseifert automaticprocesscomparisonforsubpopulationsapplicationincancercare
_version_ 1724685329558929408