An Experimental Study of Gene Team Problem

碩士 === 國立東華大學 === 資訊工程學系 === 98 === Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to...

Full description

Bibliographic Details
Main Authors:	Wei-Hsin Wang, 王偉信
Other Authors:	Sheng-Lung Peng
Format:	Others
Language:	en_US
Published:	2010
Online Access:	http://ndltd.ncl.edu.tw/handle/01877929717965752549

id	ndltd-TW-098NDHU5392063
record_format	oai_dc
spelling	ndltd-TW-098NDHU53920632016-04-22T04:23:11Z http://ndltd.ncl.edu.tw/handle/01877929717965752549 An Experimental Study of Gene Team Problem 基因群問題之實測研究 Wei-Hsin Wang 王偉信碩士國立東華大學資訊工程學系 98 Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to the distance between the homologous genes close to each other (called δ value) and homologous type, the gene team finding problem can be classified into four subproblems, namely, common intervals in permutations, common intervals in sequences, max-gap clusters in permutations, and max-gap clusters in sequences. Here, value is used to be a gap-based criterion of extensible homologous gene clusters in the genomes. Besides, the homologous type determines what kind of genome models is used (e.g., for orthologs, we consider the homologous genes as unique letters co- occured in the different genomes and genomes are considered as permutations; for paralogs, we consider the homologous genes as families which may occur more than once in the genomes, and genomes are considered as sequences). Moreover, these tasks are usually called common intervals finding problems when is equal to zero (without gaps), and called max-gap clusters finding problems in the other way. Note that such problems are regardless of homologous genes’order in the genomes. In this thesis, we focus on the problem of max-gap clusters in permutations. We propose a bucket method to implement a straightforward algorithm for this problem. Though our algorithm has worst-case time complexity in O(n^2) where n is the number of genes in a genome, it outperforms an O(nlog^2 n) algorithm in practice. Sheng-Lung Peng 彭勝龍 2010 學位論文 ; thesis 39 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立東華大學 === 資訊工程學系 === 98 === Gene teams (clusters) are groups of homologous genes which are colocated and con- tiguous along two or multiple genomes. Recently, gene teams have become a popular part of the field of comparative genomics to infer functional or evolutionary studies. According to the distance between the homologous genes close to each other (called δ value) and homologous type, the gene team finding problem can be classified into four subproblems, namely, common intervals in permutations, common intervals in sequences, max-gap clusters in permutations, and max-gap clusters in sequences. Here, value is used to be a gap-based criterion of extensible homologous gene clusters in the genomes. Besides, the homologous type determines what kind of genome models is used (e.g., for orthologs, we consider the homologous genes as unique letters co- occured in the different genomes and genomes are considered as permutations; for paralogs, we consider the homologous genes as families which may occur more than once in the genomes, and genomes are considered as sequences). Moreover, these tasks are usually called common intervals finding problems when is equal to zero (without gaps), and called max-gap clusters finding problems in the other way. Note that such problems are regardless of homologous genes’order in the genomes. In this thesis, we focus on the problem of max-gap clusters in permutations. We propose a bucket method to implement a straightforward algorithm for this problem. Though our algorithm has worst-case time complexity in O(n^2) where n is the number of genes in a genome, it outperforms an O(nlog^2 n) algorithm in practice.
author2	Sheng-Lung Peng
author_facet	Sheng-Lung Peng Wei-Hsin Wang 王偉信
author	Wei-Hsin Wang 王偉信
spellingShingle	Wei-Hsin Wang 王偉信 An Experimental Study of Gene Team Problem
author_sort	Wei-Hsin Wang
title	An Experimental Study of Gene Team Problem
title_short	An Experimental Study of Gene Team Problem
title_full	An Experimental Study of Gene Team Problem
title_fullStr	An Experimental Study of Gene Team Problem
title_full_unstemmed	An Experimental Study of Gene Team Problem
title_sort	experimental study of gene team problem
publishDate	2010
url	http://ndltd.ncl.edu.tw/handle/01877929717965752549
work_keys_str_mv	AT weihsinwang anexperimentalstudyofgeneteamproblem AT wángwěixìn anexperimentalstudyofgeneteamproblem AT weihsinwang jīyīnqúnwèntízhīshícèyánjiū AT wángwěixìn jīyīnqúnwèntízhīshícèyánjiū AT weihsinwang experimentalstudyofgeneteamproblem AT wángwěixìn experimentalstudyofgeneteamproblem
_version_	1718230371036823552

An Experimental Study of Gene Team Problem

Similar Items