Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization
DNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA data...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
De Gruyter
2018-07-01
|
Series: | Journal of Intelligent Systems |
Subjects: | |
Online Access: | https://doi.org/10.1515/jisys-2016-0111 |
id |
doaj-f59cc9e5a98343f1a528a51e407c8556 |
---|---|
record_format |
Article |
spelling |
doaj-f59cc9e5a98343f1a528a51e407c85562021-09-06T19:40:37ZengDe GruyterJournal of Intelligent Systems0334-18602191-026X2018-07-0127334936210.1515/jisys-2016-0111Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search OptimizationLakshmanna Kuruva0Khare Neelu1VIT University, Vellore, Tamil Nadu 632014, IndiaVIT University, Vellore, Tamil Nadu 632014, IndiaDNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA database utilizing constraint measures and group search optimization (GSO). In that study, GSO calculation was utilized to optimize the sequence extraction process from a given DNA database. However, it is apparent that, occasionally, such an arbitrary seeking system does not accompany the optimal solution in the given time. To overcome the problem, we proposed in this work multiple constraints with hybrid firefly and GSO (HFGSO) algorithm. The complete DNA sequence mining process comprised the following three modules: (i) applying prefix span algorithm; (ii) calculating the length, width, and regular expression (RE) constraints; and (iii) optimal mining via HFGSO. First, we apply the concept of prefix span, which detects the frequent DNA sequence pattern using a prefix tree. Based on this prefix tree, length, width, and RE constraints are applied to handle restrictions. Finally, we adopt the HFGSO algorithm for the completeness of the mining result. The experimentation is carried out on the standard DNA sequence dataset, and the evaluation with DNA sequence dataset and the results show that our approach is better than the existing approach.https://doi.org/10.1515/jisys-2016-0111prefix spanconstraintsregular expressiondna sequencemininghfgsoweightlength |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lakshmanna Kuruva Khare Neelu |
spellingShingle |
Lakshmanna Kuruva Khare Neelu Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization Journal of Intelligent Systems prefix span constraints regular expression dna sequence mining hfgso weight length |
author_facet |
Lakshmanna Kuruva Khare Neelu |
author_sort |
Lakshmanna Kuruva |
title |
Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization |
title_short |
Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization |
title_full |
Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization |
title_fullStr |
Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization |
title_full_unstemmed |
Mining DNA Sequence Patterns with Constraints Using Hybridization of Firefly and Group Search Optimization |
title_sort |
mining dna sequence patterns with constraints using hybridization of firefly and group search optimization |
publisher |
De Gruyter |
series |
Journal of Intelligent Systems |
issn |
0334-1860 2191-026X |
publishDate |
2018-07-01 |
description |
DNA sequence mining is essential in the study of the structure and function of the DNA sequence. A few exploration works have been published in the literature concerning sequence mining in information mining task. Similarly, in our past paper, an effective sequence mining was performed on a DNA database utilizing constraint measures and group search optimization (GSO). In that study, GSO calculation was utilized to optimize the sequence extraction process from a given DNA database. However, it is apparent that, occasionally, such an arbitrary seeking system does not accompany the optimal solution in the given time. To overcome the problem, we proposed in this work multiple constraints with hybrid firefly and GSO (HFGSO) algorithm. The complete DNA sequence mining process comprised the following three modules: (i) applying prefix span algorithm; (ii) calculating the length, width, and regular expression (RE) constraints; and (iii) optimal mining via HFGSO. First, we apply the concept of prefix span, which detects the frequent DNA sequence pattern using a prefix tree. Based on this prefix tree, length, width, and RE constraints are applied to handle restrictions. Finally, we adopt the HFGSO algorithm for the completeness of the mining result. The experimentation is carried out on the standard DNA sequence dataset, and the evaluation with DNA sequence dataset and the results show that our approach is better than the existing approach. |
topic |
prefix span constraints regular expression dna sequence mining hfgso weight length |
url |
https://doi.org/10.1515/jisys-2016-0111 |
work_keys_str_mv |
AT lakshmannakuruva miningdnasequencepatternswithconstraintsusinghybridizationoffireflyandgroupsearchoptimization AT khareneelu miningdnasequencepatternswithconstraintsusinghybridizationoffireflyandgroupsearchoptimization |
_version_ |
1717768082570608640 |