Application of Genetic Algorithms for Finding Edit Distance between Process Models

Finding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to ex...

Full description

Bibliographic Details
Main Authors: Anna A. Kalenkova, Danil A. Kolesnikov
Format: Article
Language:English
Published: Yaroslavl State University 2018-12-01
Series:Modelirovanie i Analiz Informacionnyh Sistem
Subjects:
Online Access:https://www.mais-journal.ru/jour/article/view/768
id doaj-24428ee6740348958195e37c7ca25ffe
record_format Article
spelling doaj-24428ee6740348958195e37c7ca25ffe2021-07-29T08:15:15ZengYaroslavl State UniversityModelirovanie i Analiz Informacionnyh Sistem1818-10152313-54172018-12-0125671172510.18255/1818-1015-2018-6-711-725536Application of Genetic Algorithms for Finding Edit Distance between Process ModelsAnna A. Kalenkova0Danil A. Kolesnikov1National Research University Higher School of EconomicsNational Research University Higher School of EconomicsFinding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to examine process models (annotated graphs) discovered from event data. In particular, finding graph-edit distance techniques can be used to reveal patterns (subprocesses), compare discovered process models. As it was shown experimentally and theoretically justified, exact methods for finding graph-edit distances between discovered process models (and graphs in general) are computationally expensive and can be applied to small models only. In this paper, we present and assess accuracy and performance characteristics of an inexact genetic algorithm applied to find distances between process models discovered from event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from event logs by using different process discovery algorithms. We show that the genetic algorithm allows us to dramatically reduce the time of comparison and produces results which are close to the optimal solutions (minimal graph edit distances calculated by the exact search algorithm).https://www.mais-journal.ru/jour/article/view/768minimal graph edit distanceprocess miningbpmn (business process model and notation)genetic algorithm
collection DOAJ
language English
format Article
sources DOAJ
author Anna A. Kalenkova
Danil A. Kolesnikov
spellingShingle Anna A. Kalenkova
Danil A. Kolesnikov
Application of Genetic Algorithms for Finding Edit Distance between Process Models
Modelirovanie i Analiz Informacionnyh Sistem
minimal graph edit distance
process mining
bpmn (business process model and notation)
genetic algorithm
author_facet Anna A. Kalenkova
Danil A. Kolesnikov
author_sort Anna A. Kalenkova
title Application of Genetic Algorithms for Finding Edit Distance between Process Models
title_short Application of Genetic Algorithms for Finding Edit Distance between Process Models
title_full Application of Genetic Algorithms for Finding Edit Distance between Process Models
title_fullStr Application of Genetic Algorithms for Finding Edit Distance between Process Models
title_full_unstemmed Application of Genetic Algorithms for Finding Edit Distance between Process Models
title_sort application of genetic algorithms for finding edit distance between process models
publisher Yaroslavl State University
series Modelirovanie i Analiz Informacionnyh Sistem
issn 1818-1015
2313-5417
publishDate 2018-12-01
description Finding graph-edit distance (graph similarity) is an important task in many computer science areas, such as image analysis, machine learning, chemicalinformatics. Recently, with the development of process mining techniques, it became important to adapt and apply existing graph analysis methods to examine process models (annotated graphs) discovered from event data. In particular, finding graph-edit distance techniques can be used to reveal patterns (subprocesses), compare discovered process models. As it was shown experimentally and theoretically justified, exact methods for finding graph-edit distances between discovered process models (and graphs in general) are computationally expensive and can be applied to small models only. In this paper, we present and assess accuracy and performance characteristics of an inexact genetic algorithm applied to find distances between process models discovered from event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from event logs by using different process discovery algorithms. We show that the genetic algorithm allows us to dramatically reduce the time of comparison and produces results which are close to the optimal solutions (minimal graph edit distances calculated by the exact search algorithm).
topic minimal graph edit distance
process mining
bpmn (business process model and notation)
genetic algorithm
url https://www.mais-journal.ru/jour/article/view/768
work_keys_str_mv AT annaakalenkova applicationofgeneticalgorithmsforfindingeditdistancebetweenprocessmodels
AT danilakolesnikov applicationofgeneticalgorithmsforfindingeditdistancebetweenprocessmodels
_version_ 1721256654548238336