Application of Genetic Algorithms for Finding Edit Distance between Process Models
Finding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to ex...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Yaroslavl State University
2018-12-01
|
Series: | Modelirovanie i Analiz Informacionnyh Sistem |
Subjects: | |
Online Access: | https://www.mais-journal.ru/jour/article/view/768 |
id |
doaj-24428ee6740348958195e37c7ca25ffe |
---|---|
record_format |
Article |
spelling |
doaj-24428ee6740348958195e37c7ca25ffe2021-07-29T08:15:15ZengYaroslavl State UniversityModelirovanie i Analiz Informacionnyh Sistem1818-10152313-54172018-12-0125671172510.18255/1818-1015-2018-6-711-725536Application of Genetic Algorithms for Finding Edit Distance between Process ModelsAnna A. Kalenkova0Danil A. Kolesnikov1National Research University Higher School of EconomicsNational Research University Higher School of EconomicsFinding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to examine process models (annotated graphs) discovered from event data. In particular, finding graph-edit distance techniques can be used to reveal patterns (subprocesses), compare discovered process models. As it was shown experimentally and theoretically justified, exact methods for finding graph-edit distances between discovered process models (and graphs in general) are computationally expensive and can be applied to small models only. In this paper, we present and assess accuracy and performance characteristics of an inexact genetic algorithm applied to find distances between process models discovered from event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from event logs by using different process discovery algorithms. We show that the genetic algorithm allows us to dramatically reduce the time of comparison and produces results which are close to the optimal solutions (minimal graph edit distances calculated by the exact search algorithm).https://www.mais-journal.ru/jour/article/view/768minimal graph edit distanceprocess miningbpmn (business process model and notation)genetic algorithm |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Anna A. Kalenkova Danil A. Kolesnikov |
spellingShingle |
Anna A. Kalenkova Danil A. Kolesnikov Application of Genetic Algorithms for Finding Edit Distance between Process Models Modelirovanie i Analiz Informacionnyh Sistem minimal graph edit distance process mining bpmn (business process model and notation) genetic algorithm |
author_facet |
Anna A. Kalenkova Danil A. Kolesnikov |
author_sort |
Anna A. Kalenkova |
title |
Application of Genetic Algorithms for Finding Edit Distance between Process Models |
title_short |
Application of Genetic Algorithms for Finding Edit Distance between Process Models |
title_full |
Application of Genetic Algorithms for Finding Edit Distance between Process Models |
title_fullStr |
Application of Genetic Algorithms for Finding Edit Distance between Process Models |
title_full_unstemmed |
Application of Genetic Algorithms for Finding Edit Distance between Process Models |
title_sort |
application of genetic algorithms for finding edit distance between process models |
publisher |
Yaroslavl State University |
series |
Modelirovanie i Analiz Informacionnyh Sistem |
issn |
1818-1015 2313-5417 |
publishDate |
2018-12-01 |
description |
Finding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to examine process models (annotated graphs) discovered from event data. In particular, finding graph-edit distance techniques can be used to reveal patterns (subprocesses), compare discovered process models. As it was shown experimentally and theoretically justified, exact methods for finding graph-edit distances between discovered process models (and graphs in general) are computationally expensive and can be applied to small models only. In this paper, we present and assess accuracy and performance characteristics of an inexact genetic algorithm applied to find distances between process models discovered from event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from event logs by using different process discovery algorithms. We show that the genetic algorithm allows us to dramatically reduce the time of comparison and produces results which are close to the optimal solutions (minimal graph edit distances calculated by the exact search algorithm). |
topic |
minimal graph edit distance process mining bpmn (business process model and notation) genetic algorithm |
url |
https://www.mais-journal.ru/jour/article/view/768 |
work_keys_str_mv |
AT annaakalenkova applicationofgeneticalgorithmsforfindingeditdistancebetweenprocessmodels AT danilakolesnikov applicationofgeneticalgorithmsforfindingeditdistancebetweenprocessmodels |
_version_ |
1721256654548238336 |