Data Mining for Regulatory Elements in Repeat Sequences
碩士 === 國立中央大學 === 資訊工程研究所 === 88 === Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2000
|
Online Access: | http://ndltd.ncl.edu.tw/handle/95026997305077670391 |
id |
ndltd-TW-088NCU00392007 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-088NCU003920072016-07-08T04:22:42Z http://ndltd.ncl.edu.tw/handle/95026997305077670391 Data Mining for Regulatory Elements in Repeat Sequences 應用資料採礦於基因體之重複序列資料庫 Wen-Fu Cho 卓文福 碩士 國立中央大學 資訊工程研究所 88 Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In this thesis, we first mark the transcription factor binding sites in the repeat sequences and then apply data mining techniques to mine the association rules from the combinations of binding sites. We further prune the discovered associations to remove those insignificant associations and find a set of useful rules. Finally, we use the discovered association rules to partially classify the repeat sequences in our repeat database. We also experiment on several genomes including C.Elegans, Human Chromosome 22, and Yeast. Jorng-Tzong Horng 洪炯宗 2000 學位論文 ; thesis 38 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊工程研究所 === 88 === Human Genome Project began at 1988 and then lots of genomes will be sequencialized later. Repeat sequences in genome sequences play an important role in medical diagnosis and research. The Transcription factor database TRANSFAC collects many promoter classes. In this thesis, we first mark the transcription factor binding sites in the repeat sequences and then apply data mining techniques to mine the association rules from the combinations of binding sites. We further prune the discovered associations to remove those insignificant associations and find a set of useful rules. Finally, we use the discovered association rules to partially classify the repeat sequences in our repeat database. We also experiment on several genomes including C.Elegans, Human Chromosome 22, and Yeast.
|
author2 |
Jorng-Tzong Horng |
author_facet |
Jorng-Tzong Horng Wen-Fu Cho 卓文福 |
author |
Wen-Fu Cho 卓文福 |
spellingShingle |
Wen-Fu Cho 卓文福 Data Mining for Regulatory Elements in Repeat Sequences |
author_sort |
Wen-Fu Cho |
title |
Data Mining for Regulatory Elements in Repeat Sequences |
title_short |
Data Mining for Regulatory Elements in Repeat Sequences |
title_full |
Data Mining for Regulatory Elements in Repeat Sequences |
title_fullStr |
Data Mining for Regulatory Elements in Repeat Sequences |
title_full_unstemmed |
Data Mining for Regulatory Elements in Repeat Sequences |
title_sort |
data mining for regulatory elements in repeat sequences |
publishDate |
2000 |
url |
http://ndltd.ncl.edu.tw/handle/95026997305077670391 |
work_keys_str_mv |
AT wenfucho dataminingforregulatoryelementsinrepeatsequences AT zhuōwénfú dataminingforregulatoryelementsinrepeatsequences AT wenfucho yīngyòngzīliàocǎikuàngyújīyīntǐzhīzhòngfùxùlièzīliàokù AT zhuōwénfú yīngyòngzīliàocǎikuàngyújīyīntǐzhīzhòngfùxùlièzīliàokù |
_version_ |
1718339961865895936 |