Data Mining for Regulatory Elements in Repeat Sequences

碩士 === 國立中央大學 === 資訊工程研究所 === 88 === Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In...

Full description

Bibliographic Details
Main Authors: Wen-Fu Cho, 卓文福
Other Authors: Jorng-Tzong Horng
Format: Others
Language:en_US
Published: 2000
Online Access:http://ndltd.ncl.edu.tw/handle/95026997305077670391
id ndltd-TW-088NCU00392007
record_format oai_dc
spelling ndltd-TW-088NCU003920072016-07-08T04:22:42Z http://ndltd.ncl.edu.tw/handle/95026997305077670391 Data Mining for Regulatory Elements in Repeat Sequences 應用資料採礦於基因體之重複序列資料庫 Wen-Fu Cho 卓文福 碩士 國立中央大學 資訊工程研究所 88 Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In this thesis, we first mark the transcription factor binding sites in the repeat sequences and then apply data mining techniques to mine the association rules from the combinations of binding sites. We further prune the discovered associations to remove those insignificant associations and find a set of useful rules. Finally, we use the discovered association rules to partially classify the repeat sequences in our repeat database. We also experiment on several genomes including C.Elegans, Human Chromosome 22, and Yeast. Jorng-Tzong Horng 洪炯宗 2000 學位論文 ; thesis 38 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊工程研究所 === 88 === Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In this thesis, we first mark the transcription factor binding sites in the repeat sequences and then apply data mining techniques to mine the association rules from the combinations of binding sites. We further prune the discovered associations to remove those insignificant associations and find a set of useful rules. Finally, we use the discovered association rules to partially classify the repeat sequences in our repeat database. We also experiment on several genomes including C.Elegans, Human Chromosome 22, and Yeast.
author2 Jorng-Tzong Horng
author_facet Jorng-Tzong Horng
Wen-Fu Cho
卓文福
author Wen-Fu Cho
卓文福
spellingShingle Wen-Fu Cho
卓文福
Data Mining for Regulatory Elements in Repeat Sequences
author_sort Wen-Fu Cho
title Data Mining for Regulatory Elements in Repeat Sequences
title_short Data Mining for Regulatory Elements in Repeat Sequences
title_full Data Mining for Regulatory Elements in Repeat Sequences
title_fullStr Data Mining for Regulatory Elements in Repeat Sequences
title_full_unstemmed Data Mining for Regulatory Elements in Repeat Sequences
title_sort data mining for regulatory elements in repeat sequences
publishDate 2000
url http://ndltd.ncl.edu.tw/handle/95026997305077670391
work_keys_str_mv AT wenfucho dataminingforregulatoryelementsinrepeatsequences
AT zhuōwénfú dataminingforregulatoryelementsinrepeatsequences
AT wenfucho yīngyòngzīliàocǎikuàngyújīyīntǐzhīzhòngfùxùlièzīliàokù
AT zhuōwénfú yīngyòngzīliàocǎikuàngyújīyīntǐzhīzhòngfùxùlièzīliàokù
_version_ 1718339961865895936