Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions
The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is...
Main Author: | |
---|---|
Other Authors: | |
Language: | en |
Published: |
2013
|
Online Access: | http://hdl.handle.net/10754/271672 |
id |
ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-271672 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-kaust.edu.sa-oai-repository.kaust.edu.sa-10754-2716722020-08-24T05:08:18Z Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions Odat, Enas M. Moshkov, Mikhail Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division Ravasi, Timothy Shihada, Basem The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is suggested. Sequence alignment is mostly important in computational biology. It is used to find evolutionary relationships between biological sequences. There are many algo- rithms that have been developed to solve this problem. The most famous algorithms are Needleman-Wunsch and Smith-Waterman that are based on dynamic program- ming. In dynamic programming, problem is divided into a set of overlapping sub- problems and then the solution of each subproblem is found. Finally, the solutions to these subproblems are combined into a final solution. In this thesis it has been proved that for two sequences of length m and n over a fixed alphabet, the suggested optimization procedure requires O(mn) arithmetic operations per cost function on a single processor machine. The algorithm has been simulated using C#.Net programming language and a number of experiments have been done to verify the proved statements. The results of these experiments show that the number of optimal alignments is reduced after each step of optimization. Furthermore, it has been verified that as the sequence length increased linearly then the number of optimal alignments increased exponentially which also depends on the cost function that is used. Finally, the number of executed operations increases polynomially as the sequence length increase linearly. 2013-03-11T06:11:35Z 2015-01-01T00:00:00Z 2011-05 Thesis 10.25781/KAUST-42E8N http://hdl.handle.net/10754/271672 en 2015-01-01 At the time of archiving, the student author of this thesis opted to temporarily restrict access to it. The full text of this thesis became available to the public after the expiration of the embargo on 2015-01-01. |
collection |
NDLTD |
language |
en |
sources |
NDLTD |
description |
The purpose of this dissertation is to present a methodology to model global sequence alignment problem as directed acyclic graph which helps to extract all possible optimal alignments. Moreover, a mechanism to sequentially optimize sequence alignment problem relative to different cost functions is suggested.
Sequence alignment is mostly important in computational biology. It is used to find evolutionary relationships between biological sequences. There are many algo- rithms that have been developed to solve this problem. The most famous algorithms are Needleman-Wunsch and Smith-Waterman that are based on dynamic program- ming. In dynamic programming, problem is divided into a set of overlapping sub- problems and then the solution of each subproblem is found. Finally, the solutions to these subproblems are combined into a final solution.
In this thesis it has been proved that for two sequences of length m and n over a fixed alphabet, the suggested optimization procedure requires O(mn) arithmetic operations per cost function on a single processor machine.
The algorithm has been simulated using C#.Net programming language and a number of experiments have been done to verify the proved statements. The results of these experiments show that the number of optimal alignments is reduced after each step of optimization. Furthermore, it has been verified that as the sequence length increased linearly then the number of optimal alignments increased exponentially which also depends on the cost function that is used. Finally, the number of executed operations increases polynomially as the sequence length increase linearly. |
author2 |
Moshkov, Mikhail |
author_facet |
Moshkov, Mikhail Odat, Enas M. |
author |
Odat, Enas M. |
spellingShingle |
Odat, Enas M. Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions |
author_sort |
Odat, Enas M. |
title |
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions |
title_short |
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions |
title_full |
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions |
title_fullStr |
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions |
title_full_unstemmed |
Sequential Optimization of Global Sequence Alignments Relative to Different Cost Functions |
title_sort |
sequential optimization of global sequence alignments relative to different cost functions |
publishDate |
2013 |
url |
http://hdl.handle.net/10754/271672 |
work_keys_str_mv |
AT odatenasm sequentialoptimizationofglobalsequencealignmentsrelativetodifferentcostfunctions |
_version_ |
1719338860143443968 |