Extracting Network Preferential Summary with Bootstrapping Method
碩士 === 國立成功大學 === 資訊管理研究所 === 103 === The output value of e-commerce has obviously growing in 2008. Consumers have most interest in discount and preferential information. It’s difficult for search engine to keep latest and the most comprehensive search result. This research use bootstrapping method...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/55868538860016135710 |
id |
ndltd-TW-103NCKU5396007 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NCKU53960072016-05-22T04:40:55Z http://ndltd.ncl.edu.tw/handle/55868538860016135710 Extracting Network Preferential Summary with Bootstrapping Method 以Bootstrapping方法萃取網路優惠摘要 Yan-FuCheng 程彥輔 碩士 國立成功大學 資訊管理研究所 103 The output value of e-commerce has obviously growing in 2008. Consumers have most interest in discount and preferential information. It’s difficult for search engine to keep latest and the most comprehensive search result. This research use bootstrapping method with text mining. After determine preferential keyword, set the website that has complete preferential information as seed pages. Finding document object model (DOM) position of preferential information with XML path language (XPath) to get the pattern that can extract preferential information. The pattern will download webpages from chosen websites. Analyzing these pages with word segmentation system and Distance Point-Wise Mutual Information (DPMI), learning new preferential keywords with bootstrapping method. Combine preferential keyword and store or product name for search engine to find out new preferential websites. Developing a user interface which provides preferential information like: buy one get one, buy one, get one half price, etc. Experiment result shows that DPMI using two as word distance has the greatest precision 29.4%, 9.4% higher than PMI’s result 20%. Hei-Chia Wang 王惠嘉 2015 學位論文 ; thesis 56 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊管理研究所 === 103 === The output value of e-commerce has obviously growing in 2008. Consumers have most interest in discount and preferential information. It’s difficult for search engine to keep latest and the most comprehensive search result.
This research use bootstrapping method with text mining. After determine preferential keyword, set the website that has complete preferential information as seed pages. Finding document object model (DOM) position of preferential information with XML path language (XPath) to get the pattern that can extract preferential information. The pattern will download webpages from chosen websites. Analyzing these pages with word segmentation system and Distance Point-Wise Mutual Information (DPMI), learning new preferential keywords with bootstrapping method. Combine preferential keyword and store or product name for search engine to find out new preferential websites. Developing a user interface which provides preferential information like: buy one get one, buy one, get one half price, etc.
Experiment result shows that DPMI using two as word distance has the greatest precision 29.4%, 9.4% higher than PMI’s result 20%.
|
author2 |
Hei-Chia Wang |
author_facet |
Hei-Chia Wang Yan-FuCheng 程彥輔 |
author |
Yan-FuCheng 程彥輔 |
spellingShingle |
Yan-FuCheng 程彥輔 Extracting Network Preferential Summary with Bootstrapping Method |
author_sort |
Yan-FuCheng |
title |
Extracting Network Preferential Summary with Bootstrapping Method |
title_short |
Extracting Network Preferential Summary with Bootstrapping Method |
title_full |
Extracting Network Preferential Summary with Bootstrapping Method |
title_fullStr |
Extracting Network Preferential Summary with Bootstrapping Method |
title_full_unstemmed |
Extracting Network Preferential Summary with Bootstrapping Method |
title_sort |
extracting network preferential summary with bootstrapping method |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/55868538860016135710 |
work_keys_str_mv |
AT yanfucheng extractingnetworkpreferentialsummarywithbootstrappingmethod AT chéngyànfǔ extractingnetworkpreferentialsummarywithbootstrappingmethod AT yanfucheng yǐbootstrappingfāngfǎcuìqǔwǎnglùyōuhuìzhāiyào AT chéngyànfǔ yǐbootstrappingfāngfǎcuìqǔwǎnglùyōuhuìzhāiyào |
_version_ |
1718277151686393856 |