Automatic Process Comparison for Subpopulations: Application in Cancer Care
Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for qual...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-08-01
|
Series: | International Journal of Environmental Research and Public Health |
Subjects: | |
Online Access: | https://www.mdpi.com/1660-4601/17/16/5707 |
id |
doaj-c1a1fbd60649494e9ece54b5c171c56d |
---|---|
record_format |
Article |
spelling |
doaj-c1a1fbd60649494e9ece54b5c171c56d2020-11-25T03:03:31ZengMDPI AGInternational Journal of Environmental Research and Public Health1661-78271660-46012020-08-01175707570710.3390/ijerph17165707Automatic Process Comparison for Subpopulations: Application in Cancer CareFrancesca Marazza0Faiza Allah Bukhsh1Jeroen Geerdink2Onno Vijlbrief3Shreyasi Pathak4Maurice van Keulen5and Christin Seifert6Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Hospital Group Twente (ZGT), 7555 DL Hengelo, The Netherlands; j.geerdink@zgt.nl (J.G.); o.vijlbrief@zgt.nl (O.V.)Hospital Group Twente (ZGT), 7555 DL Hengelo, The Netherlands; j.geerdink@zgt.nl (J.G.); o.vijlbrief@zgt.nl (O.V.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Faculty of Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, 7522 NB Enschede, The Netherlands; f.a.bukhsh (F.A.B.); s.pathak@utwente.nl (S.P.); m.vankeulen@utwente.nl (M.v.K.)Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for quality control and improvement. Process discovery from event data in electronic health records can shed light on the patient flows, but their comparison for different populations is cumbersome and time-consuming. In this paper, we present an approach for the automatic comparison of process models that were extracted from events in electronic health records. Concretely, we propose comparing processes for different patient populations by cross-log conformance checking, and standard graph similarity measures obtained from the directed graph underlying the process model. We perform a user study with 20 participants in order to obtain a ground truth for similarity of process models. We evaluate our approach on two data sets, the publicly available MIMIC database with the focus on different cancer patients in intensive care, and a database on breast cancer patients from a Dutch hospital. In our experiments, we found average fitness to be a good indicator for visual similarity in the ZGT use case, while the average precision and graph edit distance are strongly correlated with visual impression for cancer process models on MIMIC. These results are a call for further research and evaluation for determining which similarity or combination of similarities is needed in which type of process model comparison.https://www.mdpi.com/1660-4601/17/16/5707process miningprocess comparisonquality controlcancer typesbreast cancer careMIMIC database |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Francesca Marazza Faiza Allah Bukhsh Jeroen Geerdink Onno Vijlbrief Shreyasi Pathak Maurice van Keulen and Christin Seifert |
spellingShingle |
Francesca Marazza Faiza Allah Bukhsh Jeroen Geerdink Onno Vijlbrief Shreyasi Pathak Maurice van Keulen and Christin Seifert Automatic Process Comparison for Subpopulations: Application in Cancer Care International Journal of Environmental Research and Public Health process mining process comparison quality control cancer types breast cancer care MIMIC database |
author_facet |
Francesca Marazza Faiza Allah Bukhsh Jeroen Geerdink Onno Vijlbrief Shreyasi Pathak Maurice van Keulen and Christin Seifert |
author_sort |
Francesca Marazza |
title |
Automatic Process Comparison for Subpopulations: Application in Cancer Care |
title_short |
Automatic Process Comparison for Subpopulations: Application in Cancer Care |
title_full |
Automatic Process Comparison for Subpopulations: Application in Cancer Care |
title_fullStr |
Automatic Process Comparison for Subpopulations: Application in Cancer Care |
title_full_unstemmed |
Automatic Process Comparison for Subpopulations: Application in Cancer Care |
title_sort |
automatic process comparison for subpopulations: application in cancer care |
publisher |
MDPI AG |
series |
International Journal of Environmental Research and Public Health |
issn |
1661-7827 1660-4601 |
publishDate |
2020-08-01 |
description |
Processes in organisations, such as hospitals, may deviate from the intended standard processes, due to unforeseeable events and the complexity of the organisation. For hospitals, the knowledge of actual patient streams for patient populations (e.g., severe or non-severe cases) is important for quality control and improvement. Process discovery from event data in electronic health records can shed light on the patient flows, but their comparison for different populations is cumbersome and time-consuming. In this paper, we present an approach for the automatic comparison of process models that were extracted from events in electronic health records. Concretely, we propose comparing processes for different patient populations by cross-log conformance checking, and standard graph similarity measures obtained from the directed graph underlying the process model. We perform a user study with 20 participants in order to obtain a ground truth for similarity of process models. We evaluate our approach on two data sets, the publicly available MIMIC database with the focus on different cancer patients in intensive care, and a database on breast cancer patients from a Dutch hospital. In our experiments, we found average fitness to be a good indicator for visual similarity in the ZGT use case, while the average precision and graph edit distance are strongly correlated with visual impression for cancer process models on MIMIC. These results are a call for further research and evaluation for determining which similarity or combination of similarities is needed in which type of process model comparison. |
topic |
process mining process comparison quality control cancer types breast cancer care MIMIC database |
url |
https://www.mdpi.com/1660-4601/17/16/5707 |
work_keys_str_mv |
AT francescamarazza automaticprocesscomparisonforsubpopulationsapplicationincancercare AT faizaallahbukhsh automaticprocesscomparisonforsubpopulationsapplicationincancercare AT jeroengeerdink automaticprocesscomparisonforsubpopulationsapplicationincancercare AT onnovijlbrief automaticprocesscomparisonforsubpopulationsapplicationincancercare AT shreyasipathak automaticprocesscomparisonforsubpopulationsapplicationincancercare AT mauricevankeulen automaticprocesscomparisonforsubpopulationsapplicationincancercare AT andchristinseifert automaticprocesscomparisonforsubpopulationsapplicationincancercare |
_version_ |
1724685329558929408 |