Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments

Abstract Background Emerging pattern mining is a data mining task that extracts rules describing discriminative relationships amongst variables. These rules should be understandable for the experts. Comprehensibility of a rule is traditionally determined by several objectives, which can be calculate...

Full description

Bibliographic Details
Main Authors: Ángel Miguel García-Vico, Pedro González, Cristóbal José Carmona, María José del Jesus
Format: Article
Language:English
Published: BMC 2019-01-01
Series:Big Data Analytics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s41044-018-0038-8
id doaj-8b9dbc42170f4d2ca470a82f79d61d72
record_format Article
spelling doaj-8b9dbc42170f4d2ca470a82f79d61d722020-11-25T01:41:45ZengBMCBig Data Analytics2058-63452019-01-014111510.1186/s41044-018-0038-8Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environmentsÁngel Miguel García-Vico0Pedro González1Cristóbal José Carmona2María José del Jesus3Department of Computer Science, University of JaénDepartment of Computer Science, University of JaénDepartment of Computer Science, University of JaénDepartment of Computer Science, University of JaénAbstract Background Emerging pattern mining is a data mining task that extracts rules describing discriminative relationships amongst variables. These rules should be understandable for the experts. Comprehensibility of a rule is traditionally determined by several objectives, which can be calculated by different measures. In this way, multi-objective evolutionary algorithms are suitable for this task. Currently, the growing amount of data makes traditional data mining tasks unable to process them in a reasonable time. These huge amounts of data make even more interesting the extraction of rules that can easily describe the underlying phenomena of this big data. So far there is only one algorithm for emerging pattern mining developed based on multi-objective evolutionary algorithms for big data, the BD-EFEP algorithm. The influence of the selection of different quality measures as objectives in the search process is analysed in this paper. Results The results show that the use of the combination based on Jaccard index and false positive rate is the one with the best trade-off for descriptive induction of emerging patterns. Conclusions It is recommended the use of this combination of quality measure as optimisation objectives in future multi-objective evolutionary algorithm developments for emerging pattern mining focused in big data.http://link.springer.com/article/10.1186/s41044-018-0038-8Evolutionary algorithmsFuzzy systemsBig dataEmerging pattern mining
collection DOAJ
language English
format Article
sources DOAJ
author Ángel Miguel García-Vico
Pedro González
Cristóbal José Carmona
María José del Jesus
spellingShingle Ángel Miguel García-Vico
Pedro González
Cristóbal José Carmona
María José del Jesus
Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
Big Data Analytics
Evolutionary algorithms
Fuzzy systems
Big data
Emerging pattern mining
author_facet Ángel Miguel García-Vico
Pedro González
Cristóbal José Carmona
María José del Jesus
author_sort Ángel Miguel García-Vico
title Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
title_short Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
title_full Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
title_fullStr Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
title_full_unstemmed Study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
title_sort study on the use of different quality measures within a multi-objective evolutionary algorithm approach for emerging pattern mining in big data environments
publisher BMC
series Big Data Analytics
issn 2058-6345
publishDate 2019-01-01
description Abstract Background Emerging pattern mining is a data mining task that extracts rules describing discriminative relationships amongst variables. These rules should be understandable for the experts. Comprehensibility of a rule is traditionally determined by several objectives, which can be calculated by different measures. In this way, multi-objective evolutionary algorithms are suitable for this task. Currently, the growing amount of data makes traditional data mining tasks unable to process them in a reasonable time. These huge amounts of data make even more interesting the extraction of rules that can easily describe the underlying phenomena of this big data. So far there is only one algorithm for emerging pattern mining developed based on multi-objective evolutionary algorithms for big data, the BD-EFEP algorithm. The influence of the selection of different quality measures as objectives in the search process is analysed in this paper. Results The results show that the use of the combination based on Jaccard index and false positive rate is the one with the best trade-off for descriptive induction of emerging patterns. Conclusions It is recommended the use of this combination of quality measure as optimisation objectives in future multi-objective evolutionary algorithm developments for emerging pattern mining focused in big data.
topic Evolutionary algorithms
Fuzzy systems
Big data
Emerging pattern mining
url http://link.springer.com/article/10.1186/s41044-018-0038-8
work_keys_str_mv AT angelmiguelgarciavico studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments
AT pedrogonzalez studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments
AT cristobaljosecarmona studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments
AT mariajosedeljesus studyontheuseofdifferentqualitymeasureswithinamultiobjectiveevolutionaryalgorithmapproachforemergingpatternmininginbigdataenvironments
_version_ 1725039871309905920