Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
We present a novel fast string matching technique for special DNA pattern forms and compare performance of recent CPU architectures on the matching problem. In particular, we consider consensus P53 DNA-binding consensus sequence, which has an important contribution for cancer treatment. Based on bio...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Bursa Uludag University
2016-12-01
|
Series: | Uludağ University Journal of The Faculty of Engineering |
Subjects: | |
Online Access: | http://mmfdergi.uludag.edu.tr/article/view/5000183343 |
id |
doaj-1bb624e9c49b4ec2b80b76cca1b55357 |
---|---|
record_format |
Article |
spelling |
doaj-1bb624e9c49b4ec2b80b76cca1b553572020-11-25T03:29:22ZengBursa Uludag UniversityUludağ University Journal of The Faculty of Engineering2148-41472148-41552016-12-0121226928210.17482/uujfe.213855000171320Detection of P53 Consensus Sequence: A Novel String Matching With Classes AlgorithmGıyasettin ÖZCAN0Dumlupınar ÜniversitesiWe present a novel fast string matching technique for special DNA pattern forms and compare performance of recent CPU architectures on the matching problem. In particular, we consider consensus P53 DNA-binding consensus sequence, which has an important contribution for cancer treatment. Based on biological findings, consensus P53 pattern may emerge in various sequence forms and its length is not deterministic. Therefore, classic string matching algorithms are not able to solve the problem. For efficient solution, we consider bitwise string matching algorithms with classes and present a novel search technique which is based on 64-bit packed variables. In order to prevent obstacles based on variable length of the pattern, we search specific indexes of P53 on databases. For experimental analysis, we make use of mus musculus DNA sequences with approximately 2.3 billion nucleotides. We compare algorithm performance and three architectures with various level CPU parallelism. Test results show that our technique presents search efficiency during P53 pattern search in each architecture platform. Due to its structure, the algorithm also introduces an efficient solution to similar string matching with class problems.http://mmfdergi.uludag.edu.tr/article/view/5000183343Computer EngineeringComputational BiologyText ProcessingConsensus SequencesBitwise MatchingHardware Counters |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Gıyasettin ÖZCAN |
spellingShingle |
Gıyasettin ÖZCAN Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm Uludağ University Journal of The Faculty of Engineering Computer Engineering Computational Biology Text Processing Consensus Sequences Bitwise Matching Hardware Counters |
author_facet |
Gıyasettin ÖZCAN |
author_sort |
Gıyasettin ÖZCAN |
title |
Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm |
title_short |
Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm |
title_full |
Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm |
title_fullStr |
Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm |
title_full_unstemmed |
Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm |
title_sort |
detection of p53 consensus sequence: a novel string matching with classes algorithm |
publisher |
Bursa Uludag University |
series |
Uludağ University Journal of The Faculty of Engineering |
issn |
2148-4147 2148-4155 |
publishDate |
2016-12-01 |
description |
We present a novel fast string matching technique for special DNA pattern forms and compare performance of recent CPU architectures on the matching problem. In particular, we consider consensus P53 DNA-binding consensus sequence, which has an important contribution for cancer treatment. Based on biological findings, consensus P53 pattern may emerge in various sequence forms and its length is not deterministic. Therefore, classic string matching algorithms are not able to solve the problem. For efficient solution, we consider bitwise string matching algorithms with classes and present a novel search technique which is based on 64-bit packed variables. In order to prevent obstacles based on variable length of the pattern, we search specific indexes of P53 on databases. For experimental analysis, we make use of mus musculus DNA sequences with approximately 2.3 billion nucleotides. We compare algorithm performance and three architectures with various level CPU parallelism. Test results show that our technique presents search efficiency during P53 pattern search in each architecture platform. Due to its structure, the algorithm also introduces an efficient solution to similar string matching with class problems. |
topic |
Computer Engineering Computational Biology Text Processing Consensus Sequences Bitwise Matching Hardware Counters |
url |
http://mmfdergi.uludag.edu.tr/article/view/5000183343 |
work_keys_str_mv |
AT gıyasettinozcan detectionofp53consensussequenceanovelstringmatchingwithclassesalgorithm |
_version_ |
1724579736059904000 |