A Profile Clustering Based Event Logs Repairing Approach for Process Mining

Process discovery, as the most crucial learning task in the process mining, builds some highly complex process models such as “spaghetti-like” from event logs contained large amounts of data. To enhance the process discovery method for all of flexible environments, many researc...

Full description

Bibliographic Details
Main Authors: Jiuyun Xu, Jie Liu
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8625568/
id doaj-ab228721279749fe855966aa9482fde9
record_format Article
spelling doaj-ab228721279749fe855966aa9482fde92021-03-29T22:24:28ZengIEEEIEEE Access2169-35362019-01-017178721788110.1109/ACCESS.2019.28949058625568A Profile Clustering Based Event Logs Repairing Approach for Process MiningJiuyun Xu0Jie Liu1https://orcid.org/0000-0003-1589-7191College of Computer and Communication Engineering, China University of Petroleum, Qingdao, ChinaCollege of Computer and Communication Engineering, China University of Petroleum, Qingdao, ChinaProcess discovery, as the most crucial learning task in the process mining, builds some highly complex process models such as “spaghetti-like” from event logs contained large amounts of data. To enhance the process discovery method for all of flexible environments, many researchers tried to exploit trace clustering approaches to split the logs into several homogeneous sub-logs, which are used to generate the corresponding sub-process model, respectively. However, their works are based on the assumption that the event logs are complete without missing any data values. On the contrary, the data in an event log may be lost due to some reasons such as system failure and human error. In this paper, we propose a method to deal with incomplete logs so as to discover the process model. First, we split up the event logs based on trace clustering. Then, the missing traces are assigned to the most similar clustering results, respectively. After that, with supplementing the missing data in the trace, a corresponding sub-process model is mined using the proposed method. At last, some experimental results on three real-life complex event logs demonstrate the feasibility and effectiveness of our method.https://ieeexplore.ieee.org/document/8625568/Process mininghighly flexible environmentstrace clusteringincomplete event logs
collection DOAJ
language English
format Article
sources DOAJ
author Jiuyun Xu
Jie Liu
spellingShingle Jiuyun Xu
Jie Liu
A Profile Clustering Based Event Logs Repairing Approach for Process Mining
IEEE Access
Process mining
highly flexible environments
trace clustering
incomplete event logs
author_facet Jiuyun Xu
Jie Liu
author_sort Jiuyun Xu
title A Profile Clustering Based Event Logs Repairing Approach for Process Mining
title_short A Profile Clustering Based Event Logs Repairing Approach for Process Mining
title_full A Profile Clustering Based Event Logs Repairing Approach for Process Mining
title_fullStr A Profile Clustering Based Event Logs Repairing Approach for Process Mining
title_full_unstemmed A Profile Clustering Based Event Logs Repairing Approach for Process Mining
title_sort profile clustering based event logs repairing approach for process mining
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Process discovery, as the most crucial learning task in the process mining, builds some highly complex process models such as “spaghetti-like” from event logs contained large amounts of data. To enhance the process discovery method for all of flexible environments, many researchers tried to exploit trace clustering approaches to split the logs into several homogeneous sub-logs, which are used to generate the corresponding sub-process model, respectively. However, their works are based on the assumption that the event logs are complete without missing any data values. On the contrary, the data in an event log may be lost due to some reasons such as system failure and human error. In this paper, we propose a method to deal with incomplete logs so as to discover the process model. First, we split up the event logs based on trace clustering. Then, the missing traces are assigned to the most similar clustering results, respectively. After that, with supplementing the missing data in the trace, a corresponding sub-process model is mined using the proposed method. At last, some experimental results on three real-life complex event logs demonstrate the feasibility and effectiveness of our method.
topic Process mining
highly flexible environments
trace clustering
incomplete event logs
url https://ieeexplore.ieee.org/document/8625568/
work_keys_str_mv AT jiuyunxu aprofileclusteringbasedeventlogsrepairingapproachforprocessmining
AT jieliu aprofileclusteringbasedeventlogsrepairingapproachforprocessmining
AT jiuyunxu profileclusteringbasedeventlogsrepairingapproachforprocessmining
AT jieliu profileclusteringbasedeventlogsrepairingapproachforprocessmining
_version_ 1724191674814431232