Automatic Identification of Data Blocks based on Web Page Structure

碩士 === 淡江大學 === 資訊工程學系碩士在職專班 === 98 === The internet has been a major source of information. It has taken the place of paper and become the most popular medium, such as: News web sites. Therefore, developing an automatic data collection technology is very important. At present the Really Simple Syn...

Full description

Bibliographic Details
Main Authors: Yi-Chen Liso, 廖益辰
Other Authors: Yih-Jia Tsai
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/34610846490727929812
id ndltd-TW-098TKU05392006
record_format oai_dc
spelling ndltd-TW-098TKU053920062015-10-13T13:40:01Z http://ndltd.ncl.edu.tw/handle/34610846490727929812 Automatic Identification of Data Blocks based on Web Page Structure 植基於網頁結構的資料區塊化自動分類 Yi-Chen Liso 廖益辰 碩士 淡江大學 資訊工程學系碩士在職專班 98 The internet has been a major source of information. It has taken the place of paper and become the most popular medium, such as: News web sites. Therefore, developing an automatic data collection technology is very important. At present the Really Simple Syndication (RSS) is a general of data collection method for the users. Besides, it is use the specific program analysis web page structures to obtain the web page information. When the web page changed, the program must be rewritten. Therefore, this paper provides an automated analysis web page structure method. Using this method find the web page pattern and approved it can be the rule. It has been tested in automatic collection of web page data. Yih-Jia Tsai 蔡憶佳 2010 學位論文 ; thesis 55 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 淡江大學 === 資訊工程學系碩士在職專班 === 98 === The internet has been a major source of information. It has taken the place of paper and become the most popular medium, such as: News web sites. Therefore, developing an automatic data collection technology is very important. At present the Really Simple Syndication (RSS) is a general of data collection method for the users. Besides, it is use the specific program analysis web page structures to obtain the web page information. When the web page changed, the program must be rewritten. Therefore, this paper provides an automated analysis web page structure method. Using this method find the web page pattern and approved it can be the rule. It has been tested in automatic collection of web page data.
author2 Yih-Jia Tsai
author_facet Yih-Jia Tsai
Yi-Chen Liso
廖益辰
author Yi-Chen Liso
廖益辰
spellingShingle Yi-Chen Liso
廖益辰
Automatic Identification of Data Blocks based on Web Page Structure
author_sort Yi-Chen Liso
title Automatic Identification of Data Blocks based on Web Page Structure
title_short Automatic Identification of Data Blocks based on Web Page Structure
title_full Automatic Identification of Data Blocks based on Web Page Structure
title_fullStr Automatic Identification of Data Blocks based on Web Page Structure
title_full_unstemmed Automatic Identification of Data Blocks based on Web Page Structure
title_sort automatic identification of data blocks based on web page structure
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/34610846490727929812
work_keys_str_mv AT yichenliso automaticidentificationofdatablocksbasedonwebpagestructure
AT liàoyìchén automaticidentificationofdatablocksbasedonwebpagestructure
AT yichenliso zhíjīyúwǎngyèjiégòudezīliàoqūkuàihuàzìdòngfēnlèi
AT liàoyìchén zhíjīyúwǎngyèjiégòudezīliàoqūkuàihuàzìdòngfēnlèi
_version_ 1717740274200870912