Deep Web Search Interface Identification: A Semi-Supervised Ensemble Approach

To surface the Deep Web, one crucial task is to predict whether a given web page has a search interface (searchable HyperText Markup Language (HTML) form) or not. Previous studies have focused on supervised classification with labeled examples. However, labeled data are scarce, hard to get and requi...

Full description

Bibliographic Details
Main Authors: Hong Wang, Qingsong Xu, Lifeng Zhou
Format: Article
Language:English
Published: MDPI AG 2014-12-01
Series:Information
Subjects:
Online Access:http://www.mdpi.com/2078-2489/5/4/634