Data Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Shahrood University of Technology
2018-07-01
|
Series: | Journal of Artificial Intelligence and Data Mining |
Subjects: | |
Online Access: | http://jad.shahroodut.ac.ir/article_990_3d26710f01637300b6a97ff0bf1441ac.pdf |