Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization

DNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA data...

Full description

Bibliographic Details
Main Authors: Lakshmanna Kuruva, Khare Neelu
Format: Article
Language:English
Published: De Gruyter 2018-07-01
Series:Journal of Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1515/jisys-2016-0111
id doaj-f59cc9e5a98343f1a528a51e407c8556
record_format Article
spelling doaj-f59cc9e5a98343f1a528a51e407c85562021-09-06T19:40:37ZengDe GruyterJournal of Intelligent Systems0334-18602191-026X2018-07-0127334936210.1515/jisys-2016-0111Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search OptimizationLakshmanna Kuruva0Khare Neelu1VIT University, Vellore, Tamil Nadu 632014, IndiaVIT University, Vellore, Tamil Nadu 632014, IndiaDNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA database utilizing constraint measures and group search optimization (GSO). In that study, GSO calculation was utilized to optimize the sequence extraction process from a given DNA database. However, it is apparent that, occasionally, such an arbitrary seeking system does not accompany the optimal solution in the given time. To overcome the problem, we proposed in this work multiple constraints with hybrid firefly and GSO (HFGSO) algorithm. The complete DNA sequence mining process comprised the following three modules: (i) applying prefix span algorithm; (ii) calculating the length, width, and regular expression (RE) constraints; and (iii) optimal mining via HFGSO. First, we apply the concept of prefix span, which detects the frequent DNA sequence pattern using a prefix tree. Based on this prefix tree, length, width, and RE constraints are applied to handle restrictions. Finally, we adopt the HFGSO algorithm for the completeness of the mining result. The experimentation is carried out on the standard DNA sequence dataset, and the evaluation with DNA sequence dataset and the results show that our approach is better than the existing approach.https://doi.org/10.1515/jisys-2016-0111prefix spanconstraintsregular expressiondna sequencemininghfgsoweightlength
collection DOAJ
language English
format Article
sources DOAJ
author Lakshmanna Kuruva
Khare Neelu
spellingShingle Lakshmanna Kuruva
Khare Neelu
Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
Journal of Intelligent Systems
prefix span
constraints
regular expression
dna sequence
mining
hfgso
weight
length
author_facet Lakshmanna Kuruva
Khare Neelu
author_sort Lakshmanna Kuruva
title Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
title_short Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
title_full Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
title_fullStr Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
title_full_unstemmed Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
title_sort mining dna sequence patterns with constraints using hybridization of firefly and group search optimization
publisher De Gruyter
series Journal of Intelligent Systems
issn 0334-1860
2191-026X
publishDate 2018-07-01
description DNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA database utilizing constraint measures and group search optimization (GSO). In that study, GSO calculation was utilized to optimize the sequence extraction process from a given DNA database. However, it is apparent that, occasionally, such an arbitrary seeking system does not accompany the optimal solution in the given time. To overcome the problem, we proposed in this work multiple constraints with hybrid firefly and GSO (HFGSO) algorithm. The complete DNA sequence mining process comprised the following three modules: (i) applying prefix span algorithm; (ii) calculating the length, width, and regular expression (RE) constraints; and (iii) optimal mining via HFGSO. First, we apply the concept of prefix span, which detects the frequent DNA sequence pattern using a prefix tree. Based on this prefix tree, length, width, and RE constraints are applied to handle restrictions. Finally, we adopt the HFGSO algorithm for the completeness of the mining result. The experimentation is carried out on the standard DNA sequence dataset, and the evaluation with DNA sequence dataset and the results show that our approach is better than the existing approach.
topic prefix span
constraints
regular expression
dna sequence
mining
hfgso
weight
length
url https://doi.org/10.1515/jisys-2016-0111
work_keys_str_mv AT lakshmannakuruva miningdnasequencepatternswithconstraintsusinghybridizationoffireflyandgroupsearchoptimization
AT khareneelu miningdnasequencepatternswithconstraintsusinghybridizationoffireflyandgroupsearchoptimization
_version_ 1717768082570608640