Data extraction from the Web using XML.
This thesis presents a mechanism based on eXtensible Markup Language (XML) to extract data from HTML-based Web pages and populate relational databases. This task is performed by a system called the XML-based Web Agent (XWA). The data extraction is done in three phases. First, the Web pages are conve...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Published: |
University of Ottawa (Canada)
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/9260 http://dx.doi.org/10.20381/ruor-7721 |