Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm

We present a novel fast string matching technique for special DNA pattern forms and compare performance of recent CPU architectures on the matching problem. In particular, we consider consensus P53 DNA-binding consensus sequence, which has an important contribution for cancer treatment. Based on bio...

Full description

Bibliographic Details
Main Author: Gıyasettin ÖZCAN
Format: Article
Language:English
Published: Bursa Uludag University 2016-12-01
Series:Uludağ University Journal of The Faculty of Engineering
Subjects:
Online Access:http://mmfdergi.uludag.edu.tr/article/view/5000183343
id doaj-1bb624e9c49b4ec2b80b76cca1b55357
record_format Article
spelling doaj-1bb624e9c49b4ec2b80b76cca1b553572020-11-25T03:29:22ZengBursa Uludag UniversityUludağ University Journal of The Faculty of Engineering2148-41472148-41552016-12-0121226928210.17482/uujfe.213855000171320Detection of P53 Consensus Sequence: A Novel String Matching With Classes AlgorithmGıyasettin ÖZCAN0Dumlupınar ÜniversitesiWe present a novel fast string matching technique for special DNA pattern forms and compare performance of recent CPU architectures on the matching problem. In particular, we consider consensus P53 DNA-binding consensus sequence, which has an important contribution for cancer treatment. Based on biological findings, consensus P53 pattern may emerge in various sequence forms and its length is not deterministic. Therefore, classic string matching algorithms are not able to solve the problem. For efficient solution, we consider bitwise string matching algorithms with classes and present a novel search technique which is based on 64-bit packed variables. In order to prevent obstacles based on variable length of the pattern, we search specific indexes of P53 on databases. For experimental analysis, we make use of mus musculus DNA sequences with approximately 2.3 billion nucleotides. We compare algorithm performance and three architectures with various level CPU parallelism. Test results show that our technique presents search efficiency during P53 pattern search in each architecture platform. Due to its structure, the algorithm also introduces an efficient solution to similar string matching with class problems.http://mmfdergi.uludag.edu.tr/article/view/5000183343Computer EngineeringComputational BiologyText ProcessingConsensus SequencesBitwise MatchingHardware Counters
collection DOAJ
language English
format Article
sources DOAJ
author Gıyasettin ÖZCAN
spellingShingle Gıyasettin ÖZCAN
Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
Uludağ University Journal of The Faculty of Engineering
Computer Engineering
Computational Biology
Text Processing
Consensus Sequences
Bitwise Matching
Hardware Counters
author_facet Gıyasettin ÖZCAN
author_sort Gıyasettin ÖZCAN
title Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
title_short Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
title_full Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
title_fullStr Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
title_full_unstemmed Detection of P53 Consensus Sequence: A Novel String Matching With Classes Algorithm
title_sort detection of p53 consensus sequence: a novel string matching with classes algorithm
publisher Bursa Uludag University
series Uludağ University Journal of The Faculty of Engineering
issn 2148-4147
2148-4155
publishDate 2016-12-01
description We present a novel fast string matching technique for special DNA pattern forms and compare performance of recent CPU architectures on the matching problem. In particular, we consider consensus P53 DNA-binding consensus sequence, which has an important contribution for cancer treatment. Based on biological findings, consensus P53 pattern may emerge in various sequence forms and its length is not deterministic. Therefore, classic string matching algorithms are not able to solve the problem. For efficient solution, we consider bitwise string matching algorithms with classes and present a novel search technique which is based on 64-bit packed variables. In order to prevent obstacles based on variable length of the pattern, we search specific indexes of P53 on databases. For experimental analysis, we make use of mus musculus DNA sequences with approximately 2.3 billion nucleotides. We compare algorithm performance and three architectures with various level CPU parallelism. Test results show that our technique presents search efficiency during P53 pattern search in each architecture platform. Due to its structure, the algorithm also introduces an efficient solution to similar string matching with class problems.
topic Computer Engineering
Computational Biology
Text Processing
Consensus Sequences
Bitwise Matching
Hardware Counters
url http://mmfdergi.uludag.edu.tr/article/view/5000183343
work_keys_str_mv AT gıyasettinozcan detectionofp53consensussequenceanovelstringmatchingwithclassesalgorithm
_version_ 1724579736059904000