General Strategy for Querying Web Sources in a Data Federation Environment

Modern database management systems are supporting the inclusion and querying of non-relational sources within a data federation environment via wrappers. Wrapper development for Web sources, however, is a convolution of code with extraction and query planning knowledge and becomes a daunting task. W...

Full description

Bibliographic Details
Main Authors: Firat, Aykut (Author), Wu, Lynn (Contributor), Madnick, Stuart E. (Contributor)
Other Authors: Sloan School of Management (Contributor)
Format: Article
Language:English
Published: IGI Global, 2011-12-01T18:38:46Z.
Subjects:
Online Access:Get fulltext
LEADER 01542 am a22002173u 4500
001 67341
042 |a dc 
100 1 0 |a Firat, Aykut  |e author 
100 1 0 |a Sloan School of Management  |e contributor 
100 1 0 |a Madnick, Stuart E.  |e contributor 
100 1 0 |a Wu, Lynn  |e contributor 
100 1 0 |a Madnick, Stuart E.  |e contributor 
700 1 0 |a Wu, Lynn  |e author 
700 1 0 |a Madnick, Stuart E.  |e author 
245 0 0 |a General Strategy for Querying Web Sources in a Data Federation Environment 
260 |b IGI Global,   |c 2011-12-01T18:38:46Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/67341 
520 |a Modern database management systems are supporting the inclusion and querying of non-relational sources within a data federation environment via wrappers. Wrapper development for Web sources, however, is a convolution of code with extraction and query planning knowledge and becomes a daunting task. We use IBM DB2 federation engine to demonstrate the challenges of incorporating Web sources into a data federation. We, then, present a practical and general strategy for the inclusion and querying of Web sources without requiring any changes in the underlying data federation technology. This strategy separates the code and knowledge in wrapper development by introducing a general-purpose capabilities-aware mini query-planner and a data extraction engine. As a result, Web sources can be included in a data federation system faster, and maintained easier. 
546 |a en_US 
655 7 |a Article 
773 |t Journal of Database Management