An empirical study on the evaluation of the RDF storage systems
Abstract In this paper, we introduce three new implementations of non-native methods for storing RDF data. These methods named RDFSPO, RDFPC and RDFVP, are based respectively on the statement table, property table and vertical partitioning approaches. As important, we consider the issue of how to se...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2021-07-01
|
Series: | Journal of Big Data |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40537-021-00486-y |
id |
doaj-277775acb1ac40b180b10c425016e935 |
---|---|
record_format |
Article |
spelling |
doaj-277775acb1ac40b180b10c425016e9352021-07-11T11:03:29ZengSpringerOpenJournal of Big Data2196-11152021-07-018112010.1186/s40537-021-00486-yAn empirical study on the evaluation of the RDF storage systemsBilal Ben Mahria0Ilham Chaker1Azeddine Zahi2Sidi Mohamed Ben Abdellah UniversitySidi Mohamed Ben Abdellah UniversitySidi Mohamed Ben Abdellah UniversityAbstract In this paper, we introduce three new implementations of non-native methods for storing RDF data. These methods named RDFSPO, RDFPC and RDFVP, are based respectively on the statement table, property table and vertical partitioning approaches. As important, we consider the issue of how to select the most relevant strategy for storing the RDF data depending on the dataset characteristics. For this, we investigate the balancing between two performance metrics, including load time and query response time. In this context, we provide an empirical comparative study between on one hand the three proposed methods, and on the other hand the proposed methods versus the existing ones by using various publicly available datasets. Finally, in order to further assess where the statistically significant differences appear between studied methods, we have performed a statistical analysis, based on the non-parametric Friedman test followed by a Nemenyi post-hoc test. The obtained results clearly show that the proposed RDFVP method achieves highly competitive computational performance against other state-of-the-art methods in terms of load time and query response time.https://doi.org/10.1186/s40537-021-00486-yRDF dataNon-native methodsStatement tableProperty tableVertical portioningFriedman test |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Bilal Ben Mahria Ilham Chaker Azeddine Zahi |
spellingShingle |
Bilal Ben Mahria Ilham Chaker Azeddine Zahi An empirical study on the evaluation of the RDF storage systems Journal of Big Data RDF data Non-native methods Statement table Property table Vertical portioning Friedman test |
author_facet |
Bilal Ben Mahria Ilham Chaker Azeddine Zahi |
author_sort |
Bilal Ben Mahria |
title |
An empirical study on the evaluation of the RDF storage systems |
title_short |
An empirical study on the evaluation of the RDF storage systems |
title_full |
An empirical study on the evaluation of the RDF storage systems |
title_fullStr |
An empirical study on the evaluation of the RDF storage systems |
title_full_unstemmed |
An empirical study on the evaluation of the RDF storage systems |
title_sort |
empirical study on the evaluation of the rdf storage systems |
publisher |
SpringerOpen |
series |
Journal of Big Data |
issn |
2196-1115 |
publishDate |
2021-07-01 |
description |
Abstract In this paper, we introduce three new implementations of non-native methods for storing RDF data. These methods named RDFSPO, RDFPC and RDFVP, are based respectively on the statement table, property table and vertical partitioning approaches. As important, we consider the issue of how to select the most relevant strategy for storing the RDF data depending on the dataset characteristics. For this, we investigate the balancing between two performance metrics, including load time and query response time. In this context, we provide an empirical comparative study between on one hand the three proposed methods, and on the other hand the proposed methods versus the existing ones by using various publicly available datasets. Finally, in order to further assess where the statistically significant differences appear between studied methods, we have performed a statistical analysis, based on the non-parametric Friedman test followed by a Nemenyi post-hoc test. The obtained results clearly show that the proposed RDFVP method achieves highly competitive computational performance against other state-of-the-art methods in terms of load time and query response time. |
topic |
RDF data Non-native methods Statement table Property table Vertical portioning Friedman test |
url |
https://doi.org/10.1186/s40537-021-00486-y |
work_keys_str_mv |
AT bilalbenmahria anempiricalstudyontheevaluationoftherdfstoragesystems AT ilhamchaker anempiricalstudyontheevaluationoftherdfstoragesystems AT azeddinezahi anempiricalstudyontheevaluationoftherdfstoragesystems AT bilalbenmahria empiricalstudyontheevaluationoftherdfstoragesystems AT ilhamchaker empiricalstudyontheevaluationoftherdfstoragesystems AT azeddinezahi empiricalstudyontheevaluationoftherdfstoragesystems |
_version_ |
1721309425600299008 |