An empirical study on the evaluation of the RDF storage systems

Abstract In this paper, we introduce three new implementations of non-native methods for storing RDF data. These methods named RDFSPO, RDFPC and RDFVP, are based respectively on the statement table, property table and vertical partitioning approaches. As important, we consider the issue of how to se...

Full description

Bibliographic Details
Main Authors: Bilal Ben Mahria, Ilham Chaker, Azeddine Zahi
Format: Article
Language:English
Published: SpringerOpen 2021-07-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-021-00486-y
id doaj-277775acb1ac40b180b10c425016e935
record_format Article
spelling doaj-277775acb1ac40b180b10c425016e9352021-07-11T11:03:29ZengSpringerOpenJournal of Big Data2196-11152021-07-018112010.1186/s40537-021-00486-yAn empirical study on the evaluation of the RDF storage systemsBilal Ben Mahria0Ilham Chaker1Azeddine Zahi2Sidi Mohamed Ben Abdellah UniversitySidi Mohamed Ben Abdellah UniversitySidi Mohamed Ben Abdellah UniversityAbstract In this paper, we introduce three new implementations of non-native methods for storing RDF data. These methods named RDFSPO, RDFPC and RDFVP, are based respectively on the statement table, property table and vertical partitioning approaches. As important, we consider the issue of how to select the most relevant strategy for storing the RDF data depending on the dataset characteristics. For this, we investigate the balancing between two performance metrics, including load time and query response time. In this context, we provide an empirical comparative study between on one hand the three proposed methods, and on the other hand the proposed methods versus the existing ones by using various publicly available datasets. Finally, in order to further assess where the statistically significant differences appear between studied methods, we have performed a statistical analysis, based on the non-parametric Friedman test followed by a Nemenyi post-hoc test. The obtained results clearly show that the proposed RDFVP method achieves highly competitive computational performance against other state-of-the-art methods in terms of load time and query response time.https://doi.org/10.1186/s40537-021-00486-yRDF dataNon-native methodsStatement tableProperty tableVertical portioningFriedman test
collection DOAJ
language English
format Article
sources DOAJ
author Bilal Ben Mahria
Ilham Chaker
Azeddine Zahi
spellingShingle Bilal Ben Mahria
Ilham Chaker
Azeddine Zahi
An empirical study on the evaluation of the RDF storage systems
Journal of Big Data
RDF data
Non-native methods
Statement table
Property table
Vertical portioning
Friedman test
author_facet Bilal Ben Mahria
Ilham Chaker
Azeddine Zahi
author_sort Bilal Ben Mahria
title An empirical study on the evaluation of the RDF storage systems
title_short An empirical study on the evaluation of the RDF storage systems
title_full An empirical study on the evaluation of the RDF storage systems
title_fullStr An empirical study on the evaluation of the RDF storage systems
title_full_unstemmed An empirical study on the evaluation of the RDF storage systems
title_sort empirical study on the evaluation of the rdf storage systems
publisher SpringerOpen
series Journal of Big Data
issn 2196-1115
publishDate 2021-07-01
description Abstract In this paper, we introduce three new implementations of non-native methods for storing RDF data. These methods named RDFSPO, RDFPC and RDFVP, are based respectively on the statement table, property table and vertical partitioning approaches. As important, we consider the issue of how to select the most relevant strategy for storing the RDF data depending on the dataset characteristics. For this, we investigate the balancing between two performance metrics, including load time and query response time. In this context, we provide an empirical comparative study between on one hand the three proposed methods, and on the other hand the proposed methods versus the existing ones by using various publicly available datasets. Finally, in order to further assess where the statistically significant differences appear between studied methods, we have performed a statistical analysis, based on the non-parametric Friedman test followed by a Nemenyi post-hoc test. The obtained results clearly show that the proposed RDFVP method achieves highly competitive computational performance against other state-of-the-art methods in terms of load time and query response time.
topic RDF data
Non-native methods
Statement table
Property table
Vertical portioning
Friedman test
url https://doi.org/10.1186/s40537-021-00486-y
work_keys_str_mv AT bilalbenmahria anempiricalstudyontheevaluationoftherdfstoragesystems
AT ilhamchaker anempiricalstudyontheevaluationoftherdfstoragesystems
AT azeddinezahi anempiricalstudyontheevaluationoftherdfstoragesystems
AT bilalbenmahria empiricalstudyontheevaluationoftherdfstoragesystems
AT ilhamchaker empiricalstudyontheevaluationoftherdfstoragesystems
AT azeddinezahi empiricalstudyontheevaluationoftherdfstoragesystems
_version_ 1721309425600299008