Automatic Identification of Data Blocks based on Web Page Structure
碩士 === 淡江大學 === 資訊工程學系碩士在職專班 === 98 === The internet has been a major source of information. It has taken the place of paper and become the most popular medium, such as: News web sites. Therefore, developing an automatic data collection technology is very important. At present the Really Simple Syn...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/34610846490727929812 |
id |
ndltd-TW-098TKU05392006 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098TKU053920062015-10-13T13:40:01Z http://ndltd.ncl.edu.tw/handle/34610846490727929812 Automatic Identification of Data Blocks based on Web Page Structure 植基於網頁結構的資料區塊化自動分類 Yi-Chen Liso 廖益辰 碩士 淡江大學 資訊工程學系碩士在職專班 98 The internet has been a major source of information. It has taken the place of paper and become the most popular medium, such as: News web sites. Therefore, developing an automatic data collection technology is very important. At present the Really Simple Syndication (RSS) is a general of data collection method for the users. Besides, it is use the specific program analysis web page structures to obtain the web page information. When the web page changed, the program must be rewritten. Therefore, this paper provides an automated analysis web page structure method. Using this method find the web page pattern and approved it can be the rule. It has been tested in automatic collection of web page data. Yih-Jia Tsai 蔡憶佳 2010 學位論文 ; thesis 55 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 淡江大學 === 資訊工程學系碩士在職專班 === 98 === The internet has been a major source of information. It has taken the place of paper and become the most popular medium, such as: News web sites. Therefore, developing an automatic data collection technology is very important.
At present the Really Simple Syndication (RSS) is a general of data collection method for the users. Besides, it is use the specific program analysis web page structures to obtain the web page information. When the web page changed, the program must be rewritten. Therefore, this paper provides an automated analysis web page structure method. Using this method find the web page pattern and approved it can be the rule. It has been tested in automatic collection of web page data.
|
author2 |
Yih-Jia Tsai |
author_facet |
Yih-Jia Tsai Yi-Chen Liso 廖益辰 |
author |
Yi-Chen Liso 廖益辰 |
spellingShingle |
Yi-Chen Liso 廖益辰 Automatic Identification of Data Blocks based on Web Page Structure |
author_sort |
Yi-Chen Liso |
title |
Automatic Identification of Data Blocks based on Web Page Structure |
title_short |
Automatic Identification of Data Blocks based on Web Page Structure |
title_full |
Automatic Identification of Data Blocks based on Web Page Structure |
title_fullStr |
Automatic Identification of Data Blocks based on Web Page Structure |
title_full_unstemmed |
Automatic Identification of Data Blocks based on Web Page Structure |
title_sort |
automatic identification of data blocks based on web page structure |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/34610846490727929812 |
work_keys_str_mv |
AT yichenliso automaticidentificationofdatablocksbasedonwebpagestructure AT liàoyìchén automaticidentificationofdatablocksbasedonwebpagestructure AT yichenliso zhíjīyúwǎngyèjiégòudezīliàoqūkuàihuàzìdòngfēnlèi AT liàoyìchén zhíjīyúwǎngyèjiégòudezīliàoqūkuàihuàzìdòngfēnlèi |
_version_ |
1717740274200870912 |