Template-based Information Extraction from Tree-structured HTML Documents
碩士 === 國立臺灣大學 === 資訊工程學系研究所 === 85 === This thesis proposes a novel approach of information extraction by identifying structural components in on-line web documents. The brief description of this approach can be introduced as follows....
Main Authors: | Yih, Wen-tau, 易文韜 |
---|---|
Other Authors: | Jane Yung-jen Hsu |
Format: | Others |
Language: | zh-TW |
Published: |
1997
|
Online Access: | http://ndltd.ncl.edu.tw/handle/29386387918552988914 |
Similar Items
-
Automatic Generation of Tree-Structured Templates for Information Extraction from HTML Documents
by: Shui-lung Chuang, et al.
Published: (1999) -
Implementation and Application of Approximate Tree Matching for Information Extraction from HTML Documents
by: Liu, Ching-hung, et al.
Published: (1998) -
Automated extraction of structured data from HTML documents
by: Stachowiak, Maciej, 1976-
Published: (2005) -
An Information Extraction Method for HTML Documents and its Applications
by: Pan, Jia-Yu, et al.
Published: (1997) -
Comparing machine learning and hand-crafted approaches for information extraction from HTML documents
by: Singer, Ron
Published: (2003)