Data extraction from the Web using XML.

This thesis presents a mechanism based on eXtensible Markup Language (XML) to extract data from HTML-based Web pages and populate relational databases. This task is performed by a system called the XML-based Web Agent (XWA). The data extraction is done in three phases. First, the Web pages are conve...

Full description

Bibliographic Details
Main Author: Ouahid, Hicham.
Other Authors: Karmouch, Ahmed
Format: Others
Published: University of Ottawa (Canada) 2009
Subjects:
Online Access:http://hdl.handle.net/10393/9260
http://dx.doi.org/10.20381/ruor-7721