Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants

碩士 === 元智大學 === 生物與醫學資訊碩士學位學程 === 99 === Promoter located in the DNA sequence, it can recruits RNA polymerase to bind the upstream of gene and start to transcribe. The promoter is different from sequence or structural even the nucleotides in the folding sequences, it produced energy is different to...

Full description

Bibliographic Details
Main Authors: Hsing-Juei Huang, 黃祥瑞
Other Authors: Ya-Ting Chao
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/88877225143749632122
id ndltd-TW-099YZU05114001
record_format oai_dc
spelling ndltd-TW-099YZU051140012016-04-13T04:16:57Z http://ndltd.ncl.edu.tw/handle/88877225143749632122 Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants 應用跨物種保留序列於辨別植物啟動子 Hsing-Juei Huang 黃祥瑞 碩士 元智大學 生物與醫學資訊碩士學位學程 99 Promoter located in the DNA sequence, it can recruits RNA polymerase to bind the upstream of gene and start to transcribe. The promoter is different from sequence or structural even the nucleotides in the folding sequences, it produced energy is different to the other sequences which is non promoter sequences in DNA. Correctly identify promoter location is become a important issue in genome biology. We focus on Arabidopsis Thaliana in this experiment, includes 33200 gene models, and construct experimental dataset after data filtered. And use orthologous sequences in Arabidopsis Thaliana and Oryza sativa to find conserved regions in evolution. We use 6,8,10-mer sequences in conserved regions and combine Match(motif finder) for training features. Finally, we use classifier QuickRBF and 5 fold cross-validation to construct the prediction model. In unbalanced dataset, we still can achieve 78.02% balance accuracy in independent data. Ya-Ting Chao 趙雅婷 2011 學位論文 ; thesis 34 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 生物與醫學資訊碩士學位學程 === 99 === Promoter located in the DNA sequence, it can recruits RNA polymerase to bind the upstream of gene and start to transcribe. The promoter is different from sequence or structural even the nucleotides in the folding sequences, it produced energy is different to the other sequences which is non promoter sequences in DNA. Correctly identify promoter location is become a important issue in genome biology. We focus on Arabidopsis Thaliana in this experiment, includes 33200 gene models, and construct experimental dataset after data filtered. And use orthologous sequences in Arabidopsis Thaliana and Oryza sativa to find conserved regions in evolution. We use 6,8,10-mer sequences in conserved regions and combine Match(motif finder) for training features. Finally, we use classifier QuickRBF and 5 fold cross-validation to construct the prediction model. In unbalanced dataset, we still can achieve 78.02% balance accuracy in independent data.
author2 Ya-Ting Chao
author_facet Ya-Ting Chao
Hsing-Juei Huang
黃祥瑞
author Hsing-Juei Huang
黃祥瑞
spellingShingle Hsing-Juei Huang
黃祥瑞
Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
author_sort Hsing-Juei Huang
title Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
title_short Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
title_full Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
title_fullStr Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
title_full_unstemmed Exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
title_sort exploiting cross-species conserved oligonucleotides to identify gene promoters in plants
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/88877225143749632122
work_keys_str_mv AT hsingjueihuang exploitingcrossspeciesconservedoligonucleotidestoidentifygenepromotersinplants
AT huángxiángruì exploitingcrossspeciesconservedoligonucleotidestoidentifygenepromotersinplants
AT hsingjueihuang yīngyòngkuàwùzhǒngbǎoliúxùlièyúbiànbiézhíwùqǐdòngzi
AT huángxiángruì yīngyòngkuàwùzhǒngbǎoliúxùlièyúbiànbiézhíwùqǐdòngzi
_version_ 1718222470639517696