An Experimental Study of Gene Team Problem
碩士 === 國立東華大學 === 資訊工程學系 === 98 === Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/01877929717965752549 |
id |
ndltd-TW-098NDHU5392063 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NDHU53920632016-04-22T04:23:11Z http://ndltd.ncl.edu.tw/handle/01877929717965752549 An Experimental Study of Gene Team Problem 基因群問題之實測研究 Wei-Hsin Wang 王偉信 碩士 國立東華大學 資訊工程學系 98 Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to the distance between the homologous genes close to each other (called δ value) and homologous type, the gene team finding problem can be classified into four subproblems, namely, common intervals in permutations, common intervals in sequences, max-gap clusters in permutations, and max-gap clusters in sequences. Here, value is used to be a gap-based criterion of extensible homologous gene clusters in the genomes. Besides, the homologous type determines what kind of genome models is used (e.g., for orthologs, we consider the homologous genes as unique letters co- occured in the different genomes and genomes are considered as permutations; for paralogs, we consider the homologous genes as families which may occur more than once in the genomes, and genomes are considered as sequences). Moreover, these tasks are usually called common intervals finding problems when is equal to zero (without gaps), and called max-gap clusters finding problems in the other way. Note that such problems are regardless of homologous genes’order in the genomes. In this thesis, we focus on the problem of max-gap clusters in permutations. We propose a bucket method to implement a straightforward algorithm for this problem. Though our algorithm has worst-case time complexity in O(n^2) where n is the number of genes in a genome, it outperforms an O(nlog^2 n) algorithm in practice. Sheng-Lung Peng 彭勝龍 2010 學位論文 ; thesis 39 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立東華大學 === 資訊工程學系 === 98 === Gene teams (clusters) are groups of homologous genes which are colocated and con-
tiguous along two or multiple genomes. Recently, gene teams have become a popular
part of the field of comparative genomics to infer functional or evolutionary studies.
According to the distance between the homologous genes close to each other (called
δ value) and homologous type, the gene team finding problem can be classified into
four subproblems, namely, common intervals in permutations, common intervals in
sequences, max-gap clusters in permutations, and max-gap clusters in sequences.
Here, value is used to be a gap-based criterion of extensible homologous gene clusters
in the genomes. Besides, the homologous type determines what kind of genome models
is used (e.g., for orthologs, we consider the homologous genes as unique letters co-
occured in the different genomes and genomes are considered as permutations; for
paralogs, we consider the homologous genes as families which may occur more than
once in the genomes, and genomes are considered as sequences). Moreover, these
tasks are usually called common intervals finding problems when is equal to zero
(without gaps), and called max-gap clusters finding problems in the other way. Note
that such problems are regardless of homologous genes’order in the genomes.
In this thesis, we focus on the problem of max-gap clusters in permutations. We
propose a bucket method to implement a straightforward algorithm for this problem.
Though our algorithm has worst-case time complexity in O(n^2) where n is the number
of genes in a genome, it outperforms an O(nlog^2 n) algorithm in practice.
|
author2 |
Sheng-Lung Peng |
author_facet |
Sheng-Lung Peng Wei-Hsin Wang 王偉信 |
author |
Wei-Hsin Wang 王偉信 |
spellingShingle |
Wei-Hsin Wang 王偉信 An Experimental Study of Gene Team Problem |
author_sort |
Wei-Hsin Wang |
title |
An Experimental Study of Gene Team Problem |
title_short |
An Experimental Study of Gene Team Problem |
title_full |
An Experimental Study of Gene Team Problem |
title_fullStr |
An Experimental Study of Gene Team Problem |
title_full_unstemmed |
An Experimental Study of Gene Team Problem |
title_sort |
experimental study of gene team problem |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/01877929717965752549 |
work_keys_str_mv |
AT weihsinwang anexperimentalstudyofgeneteamproblem AT wángwěixìn anexperimentalstudyofgeneteamproblem AT weihsinwang jīyīnqúnwèntízhīshícèyánjiū AT wángwěixìn jīyīnqúnwèntízhīshícèyánjiū AT weihsinwang experimentalstudyofgeneteamproblem AT wángwěixìn experimentalstudyofgeneteamproblem |
_version_ |
1718230371036823552 |