Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions

The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is...

Full description

Bibliographic Details
Main Author: Odat, Enas M.
Other Authors: Moshkov, Mikhail
Language:en
Published: 2013
Online Access:http://hdl.handle.net/10754/271672
id ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-271672
record_format oai_dc
spelling ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-2716722020-08-24T05:08:18Z Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions Odat, Enas M. Moshkov, Mikhail Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division Ravasi, Timothy Shihada, Basem The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is suggested. Sequence alignment is mostly important in computational biology. It is used to find evolutionary relationships between biological sequences. There are many algo- rithms that have been developed to solve this problem. The most famous algorithms are Needleman-Wunsch and Smith-Waterman that are based on dynamic program- ming. In dynamic programming, problem is divided into a set of overlapping sub- problems and then the solution of each subproblem is found. Finally, the solutions to these subproblems are combined into a final solution. In this thesis it has been proved that for two sequences of length m and n over a fixed alphabet, the suggested optimization procedure requires O(mn) arithmetic operations per cost function on a single processor machine. The algorithm has been simulated using C#.Net programming language and a number of experiments have been done to verify the proved statements. The results of these experiments show that the number of optimal alignments is reduced after each step of optimization. Furthermore, it has been verified that as the sequence length increased linearly then the number of optimal alignments increased exponentially which also depends on the cost function that is used. Finally, the number of executed operations increases polynomially as the sequence length increase linearly. 2013-03-11T06:11:35Z 2015-01-01T00:00:00Z 2011-05 Thesis 10.25781/KAUST-42E8N http://hdl.handle.net/10754/271672 en 2015-01-01 At the time of archiving, the student author of this thesis opted to temporarily restrict access to it. The full text of this thesis became available to the public after the expiration of the embargo on 2015-01-01.
collection NDLTD
language en
sources NDLTD
description The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is suggested. Sequence alignment is mostly important in computational biology. It is used to find evolutionary relationships between biological sequences. There are many algo- rithms that have been developed to solve this problem. The most famous algorithms are Needleman-Wunsch and Smith-Waterman that are based on dynamic program- ming. In dynamic programming, problem is divided into a set of overlapping sub- problems and then the solution of each subproblem is found. Finally, the solutions to these subproblems are combined into a final solution. In this thesis it has been proved that for two sequences of length m and n over a fixed alphabet, the suggested optimization procedure requires O(mn) arithmetic operations per cost function on a single processor machine. The algorithm has been simulated using C#.Net programming language and a number of experiments have been done to verify the proved statements. The results of these experiments show that the number of optimal alignments is reduced after each step of optimization. Furthermore, it has been verified that as the sequence length increased linearly then the number of optimal alignments increased exponentially which also depends on the cost function that is used. Finally, the number of executed operations increases polynomially as the sequence length increase linearly.
author2 Moshkov, Mikhail
author_facet Moshkov, Mikhail
Odat, Enas M.
author Odat, Enas M.
spellingShingle Odat, Enas M.
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
author_sort Odat, Enas M.
title Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
title_short Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
title_full Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
title_fullStr Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
title_full_unstemmed Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
title_sort sequential optimization of global sequence alignments relative to different cost functions
publishDate 2013
url http://hdl.handle.net/10754/271672
work_keys_str_mv AT odatenasm sequentialoptimizationofglobalsequencealignmentsrelativetodifferentcostfunctions
_version_ 1719338860143443968