Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
Abstract Background Emerging pattern mining is a data mining task that extracts rules describing discriminative relationships amongst variables. These rules should be understandable for the experts. Comprehensibility of a rule is traditionally determined by several objectives, which can be calculate...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2019-01-01
|
Series: | Big Data Analytics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s41044-018-0038-8 |
id |
doaj-8b9dbc42170f4d2ca470a82f79d61d72 |
---|---|
record_format |
Article |
spelling |
doaj-8b9dbc42170f4d2ca470a82f79d61d722020-11-25T01:41:45ZengBMCBig Data Analytics2058-63452019-01-014111510.1186/s41044-018-0038-8Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environmentsÁngel Miguel García-Vico0Pedro González1Cristóbal José Carmona2María José del Jesus3Department of Computer Science, University of JaénDepartment of Computer Science, University of JaénDepartment of Computer Science, University of JaénDepartment of Computer Science, University of JaénAbstract Background Emerging pattern mining is a data mining task that extracts rules describing discriminative relationships amongst variables. These rules should be understandable for the experts. Comprehensibility of a rule is traditionally determined by several objectives, which can be calculated by different measures. In this way, multi-objective evolutionary algorithms are suitable for this task. Currently, the growing amount of data makes traditional data mining tasks unable to process them in a reasonable time. These huge amounts of data make even more interesting the extraction of rules that can easily describe the underlying phenomena of this big data. So far there is only one algorithm for emerging pattern mining developed based on multi-objective evolutionary algorithms for big data, the BD-EFEP algorithm. The influence of the selection of different quality measures as objectives in the search process is analysed in this paper. Results The results show that the use of the combination based on Jaccard index and false positive rate is the one with the best trade-off for descriptive induction of emerging patterns. Conclusions It is recommended the use of this combination of quality measure as optimisation objectives in future multi-objective evolutionary algorithm developments for emerging pattern mining focused in big data.http://link.springer.com/article/10.1186/s41044-018-0038-8Evolutionary algorithmsFuzzy systemsBig dataEmerging pattern mining |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ángel Miguel García-Vico Pedro González Cristóbal José Carmona María José del Jesus |
spellingShingle |
Ángel Miguel García-Vico Pedro González Cristóbal José Carmona María José del Jesus Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments Big Data Analytics Evolutionary algorithms Fuzzy systems Big data Emerging pattern mining |
author_facet |
Ángel Miguel García-Vico Pedro González Cristóbal José Carmona María José del Jesus |
author_sort |
Ángel Miguel García-Vico |
title |
Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments |
title_short |
Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments |
title_full |
Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments |
title_fullStr |
Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments |
title_full_unstemmed |
Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments |
title_sort |
study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments |
publisher |
BMC |
series |
Big Data Analytics |
issn |
2058-6345 |
publishDate |
2019-01-01 |
description |
Abstract Background Emerging pattern mining is a data mining task that extracts rules describing discriminative relationships amongst variables. These rules should be understandable for the experts. Comprehensibility of a rule is traditionally determined by several objectives, which can be calculated by different measures. In this way, multi-objective evolutionary algorithms are suitable for this task. Currently, the growing amount of data makes traditional data mining tasks unable to process them in a reasonable time. These huge amounts of data make even more interesting the extraction of rules that can easily describe the underlying phenomena of this big data. So far there is only one algorithm for emerging pattern mining developed based on multi-objective evolutionary algorithms for big data, the BD-EFEP algorithm. The influence of the selection of different quality measures as objectives in the search process is analysed in this paper. Results The results show that the use of the combination based on Jaccard index and false positive rate is the one with the best trade-off for descriptive induction of emerging patterns. Conclusions It is recommended the use of this combination of quality measure as optimisation objectives in future multi-objective evolutionary algorithm developments for emerging pattern mining focused in big data. |
topic |
Evolutionary algorithms Fuzzy systems Big data Emerging pattern mining |
url |
http://link.springer.com/article/10.1186/s41044-018-0038-8 |
work_keys_str_mv |
AT angelmiguelgarciavico studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments AT pedrogonzalez studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments AT cristobaljosecarmona studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments AT mariajosedeljesus studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments |
_version_ |
1725039871309905920 |