Appreciation of structured and unstructured content to aid decision making : from web scraping to ontologies and data dictionaries in healthcare
A systematic approach to the extraction of data from disparate data sources is proposed. The World Wide Web is a most diverse dataset; identifying ways in which this large database provides means for data quality verification with concepts such as data lineage and provenance allows to follow the sam...
Main Author: | |
---|---|
Other Authors: | |
Published: |
University of Surrey
2016
|
Subjects: | |
Online Access: | http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.698632 |