An Experimental Study of Gene Team Problem

碩士 === 國立東華大學 === 資訊工程學系 === 98 === Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to...

Full description

Bibliographic Details
Main Authors: Wei-Hsin Wang, 王偉信
Other Authors: Sheng-Lung Peng
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/01877929717965752549
id ndltd-TW-098NDHU5392063
record_format oai_dc
spelling ndltd-TW-098NDHU53920632016-04-22T04:23:11Z http://ndltd.ncl.edu.tw/handle/01877929717965752549 An Experimental Study of Gene Team Problem 基因群問題之實測研究 Wei-Hsin Wang 王偉信 碩士 國立東華大學 資訊工程學系 98 Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to the distance between the homologous genes close to each other (called δ value) and homologous type, the gene team finding problem can be classified into four subproblems, namely, common intervals in permutations, common intervals in sequences, max-gap clusters in permutations, and max-gap clusters in sequences. Here, value is used to be a gap-based criterion of extensible homologous gene clusters in the genomes. Besides, the homologous type determines what kind of genome models is used (e.g., for orthologs, we consider the homologous genes as unique letters co- occured in the different genomes and genomes are considered as permutations; for paralogs, we consider the homologous genes as families which may occur more than once in the genomes, and genomes are considered as sequences). Moreover, these tasks are usually called common intervals finding problems when is equal to zero (without gaps), and called max-gap clusters finding problems in the other way. Note that such problems are regardless of homologous genes’order in the genomes. In this thesis, we focus on the problem of max-gap clusters in permutations. We propose a bucket method to implement a straightforward algorithm for this problem. Though our algorithm has worst-case time complexity in O(n^2) where n is the number of genes in a genome, it outperforms an O(nlog^2 n) algorithm in practice. Sheng-Lung Peng 彭勝龍 2010 學位論文 ; thesis 39 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立東華大學 === 資訊工程學系 === 98 === Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to the distance between the homologous genes close to each other (called δ value) and homologous type, the gene team finding problem can be classified into four subproblems, namely, common intervals in permutations, common intervals in sequences, max-gap clusters in permutations, and max-gap clusters in sequences. Here, value is used to be a gap-based criterion of extensible homologous gene clusters in the genomes. Besides, the homologous type determines what kind of genome models is used (e.g., for orthologs, we consider the homologous genes as unique letters co- occured in the different genomes and genomes are considered as permutations; for paralogs, we consider the homologous genes as families which may occur more than once in the genomes, and genomes are considered as sequences). Moreover, these tasks are usually called common intervals finding problems when is equal to zero (without gaps), and called max-gap clusters finding problems in the other way. Note that such problems are regardless of homologous genes’order in the genomes. In this thesis, we focus on the problem of max-gap clusters in permutations. We propose a bucket method to implement a straightforward algorithm for this problem. Though our algorithm has worst-case time complexity in O(n^2) where n is the number of genes in a genome, it outperforms an O(nlog^2 n) algorithm in practice.
author2 Sheng-Lung Peng
author_facet Sheng-Lung Peng
Wei-Hsin Wang
王偉信
author Wei-Hsin Wang
王偉信
spellingShingle Wei-Hsin Wang
王偉信
An Experimental Study of Gene Team Problem
author_sort Wei-Hsin Wang
title An Experimental Study of Gene Team Problem
title_short An Experimental Study of Gene Team Problem
title_full An Experimental Study of Gene Team Problem
title_fullStr An Experimental Study of Gene Team Problem
title_full_unstemmed An Experimental Study of Gene Team Problem
title_sort experimental study of gene team problem
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/01877929717965752549
work_keys_str_mv AT weihsinwang anexperimentalstudyofgeneteamproblem
AT wángwěixìn anexperimentalstudyofgeneteamproblem
AT weihsinwang jīyīnqúnwèntízhīshícèyánjiū
AT wángwěixìn jīyīnqúnwèntízhīshícèyánjiū
AT weihsinwang experimentalstudyofgeneteamproblem
AT wángwěixìn experimentalstudyofgeneteamproblem
_version_ 1718230371036823552