Workload-Aware Views Materialization for Big Open Linked Data

It is a trend for the public organizations to digitalize and publish their large dataset as open linked data to the public users for queries and other applications for further utilizations. Different users’ queries with various frequencies over time create different workload patterns to the servers...

Full description

Bibliographic Details
Main Authors: Tomasz J. Zlamaniec, Kuo-Ming Chao, Nick Godwin
Format: Article
Language:English
Published: World Scientific Publishing 2021-05-01
Series:Vietnam Journal of Computer Science
Subjects:
qos
Online Access:http://www.worldscientific.com/doi/epdf/10.1142/S2196888821500093
id doaj-a467774f399146668222f99eec8daa01
record_format Article
spelling doaj-a467774f399146668222f99eec8daa012021-02-01T10:41:54ZengWorld Scientific PublishingVietnam Journal of Computer Science2196-88882196-88962021-05-018221524410.1142/S219688882150009310.1142/S2196888821500093Workload-Aware Views Materialization for Big Open Linked DataTomasz J. Zlamaniec0Kuo-Ming Chao1Nick Godwin2OSP Control Systems, Ocado Technology, AL10 9NE, UKFTC, Coventry University, Coventry, CV1 2JH, UKFTC, Coventry University, Coventry, CV1 2JH, UKIt is a trend for the public organizations to digitalize and publish their large dataset as open linked data to the public users for queries and other applications for further utilizations. Different users’ queries with various frequencies over time create different workload patterns to the servers which cannot guarantee the QoS during peak usages. Materialization is a well-known effective method to reduce peaks, but it is not used by semantic webs, due to frequently evolving schema. This research is able to estimate workloads based on previous queries, analyze and normalize their structures to materialize views, and map the queries to the views with populated data. By analyzing how access patterns of individual views contribute to the overall system workload, the proposed model aims at selection of candidates offering the highest reduction of the peak workload. Consequently, rather than optimizing all queries equally, a system using the new selection method can offer higher query throughput when it is the most needed, allowing for a higher number of concurrent users without compromising QoS during the peak usage. Finally, two case studies were used to evaluate the proposed method.http://www.worldscientific.com/doi/epdf/10.1142/S2196888821500093semantic webqosview materializationoptimization strength
collection DOAJ
language English
format Article
sources DOAJ
author Tomasz J. Zlamaniec
Kuo-Ming Chao
Nick Godwin
spellingShingle Tomasz J. Zlamaniec
Kuo-Ming Chao
Nick Godwin
Workload-Aware Views Materialization for Big Open Linked Data
Vietnam Journal of Computer Science
semantic web
qos
view materialization
optimization strength
author_facet Tomasz J. Zlamaniec
Kuo-Ming Chao
Nick Godwin
author_sort Tomasz J. Zlamaniec
title Workload-Aware Views Materialization for Big Open Linked Data
title_short Workload-Aware Views Materialization for Big Open Linked Data
title_full Workload-Aware Views Materialization for Big Open Linked Data
title_fullStr Workload-Aware Views Materialization for Big Open Linked Data
title_full_unstemmed Workload-Aware Views Materialization for Big Open Linked Data
title_sort workload-aware views materialization for big open linked data
publisher World Scientific Publishing
series Vietnam Journal of Computer Science
issn 2196-8888
2196-8896
publishDate 2021-05-01
description It is a trend for the public organizations to digitalize and publish their large dataset as open linked data to the public users for queries and other applications for further utilizations. Different users’ queries with various frequencies over time create different workload patterns to the servers which cannot guarantee the QoS during peak usages. Materialization is a well-known effective method to reduce peaks, but it is not used by semantic webs, due to frequently evolving schema. This research is able to estimate workloads based on previous queries, analyze and normalize their structures to materialize views, and map the queries to the views with populated data. By analyzing how access patterns of individual views contribute to the overall system workload, the proposed model aims at selection of candidates offering the highest reduction of the peak workload. Consequently, rather than optimizing all queries equally, a system using the new selection method can offer higher query throughput when it is the most needed, allowing for a higher number of concurrent users without compromising QoS during the peak usage. Finally, two case studies were used to evaluate the proposed method.
topic semantic web
qos
view materialization
optimization strength
url http://www.worldscientific.com/doi/epdf/10.1142/S2196888821500093
work_keys_str_mv AT tomaszjzlamaniec workloadawareviewsmaterializationforbigopenlinkeddata
AT kuomingchao workloadawareviewsmaterializationforbigopenlinkeddata
AT nickgodwin workloadawareviewsmaterializationforbigopenlinkeddata
_version_ 1724315524738842624