Appreciation of structured and unstructured content to aid decision making : from web scraping to ontologies and data dictionaries in healthcare

A systematic approach to the extraction of data from disparate data sources is proposed. The World Wide Web is a most diverse dataset; identifying ways in which this large database provides means for data quality verification with concepts such as data lineage and provenance allows to follow the sam...

Full description

Bibliographic Details
Main Author: Michalakidis, Georgios
Other Authors: Krause, Paul J.
Published: University of Surrey 2016
Subjects:
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.698632