Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
碩士 === 元智大學 === 生物與醫學資訊碩士學位學程 === 99 === Promoter located in the DNA sequence, it can recruits RNA polymerase to bind the upstream of gene and start to transcribe. The promoter is different from sequence or structural even the nucleotides in the folding sequences, it produced energy is different to...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/88877225143749632122 |
id |
ndltd-TW-099YZU05114001 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099YZU051140012016-04-13T04:16:57Z http://ndltd.ncl.edu.tw/handle/88877225143749632122 Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants 應用跨物種保留序列於辨別植物啟動子 Hsing-Juei Huang 黃祥瑞 碩士 元智大學 生物與醫學資訊碩士學位學程 99 Promoter located in the DNA sequence, it can recruits RNA polymerase to bind the upstream of gene and start to transcribe. The promoter is different from sequence or structural even the nucleotides in the folding sequences, it produced energy is different to the other sequences which is non promoter sequences in DNA. Correctly identify promoter location is become a important issue in genome biology. We focus on Arabidopsis Thaliana in this experiment, includes 33200 gene models, and construct experimental dataset after data filtered. And use orthologous sequences in Arabidopsis Thaliana and Oryza sativa to find conserved regions in evolution. We use 6,8,10-mer sequences in conserved regions and combine Match(motif finder) for training features. Finally, we use classifier QuickRBF and 5 fold cross-validation to construct the prediction model. In unbalanced dataset, we still can achieve 78.02% balance accuracy in independent data. Ya-Ting Chao 趙雅婷 2011 學位論文 ; thesis 34 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 生物與醫學資訊碩士學位學程 === 99 === Promoter located in the DNA sequence, it can recruits RNA polymerase to bind the upstream of gene and start to transcribe. The promoter is different from sequence or structural even the nucleotides in the folding sequences, it produced energy is different to the other sequences which is non promoter sequences in DNA. Correctly identify promoter location is become a important issue in genome biology. We focus on Arabidopsis Thaliana in this experiment, includes 33200 gene models, and construct experimental dataset after data filtered. And use orthologous sequences in Arabidopsis Thaliana and Oryza sativa to find conserved regions in evolution. We use 6,8,10-mer sequences in conserved regions and combine Match(motif finder) for training features.
Finally, we use classifier QuickRBF and 5 fold cross-validation to construct the prediction model. In unbalanced dataset, we still can achieve 78.02% balance accuracy in independent data.
|
author2 |
Ya-Ting Chao |
author_facet |
Ya-Ting Chao Hsing-Juei Huang 黃祥瑞 |
author |
Hsing-Juei Huang 黃祥瑞 |
spellingShingle |
Hsing-Juei Huang 黃祥瑞 Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
author_sort |
Hsing-Juei Huang |
title |
Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
title_short |
Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
title_full |
Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
title_fullStr |
Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
title_full_unstemmed |
Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
title_sort |
exploiting cross-species conserved oligonucleotides to identify gene promoters in plants |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/88877225143749632122 |
work_keys_str_mv |
AT hsingjueihuang exploitingcrossspeciesconservedoligonucleotidestoidentifygenepromotersinplants AT huángxiángruì exploitingcrossspeciesconservedoligonucleotidestoidentifygenepromotersinplants AT hsingjueihuang yīngyòngkuàwùzhǒngbǎoliúxùlièyúbiànbiézhíwùqǐdòngzi AT huángxiángruì yīngyòngkuàwùzhǒngbǎoliúxùlièyúbiànbiézhíwùqǐdòngzi |
_version_ |
1718222470639517696 |