Revisiting RDF storage layouts for efficient query answering

  • M. Buron
  • , F. Goasdoué
  • , I. Manolescu
  • , T. Merabti
  • , M. L. Mugnier

Research output: Contribution to journalConference articlepeer-review

Abstract

The performance of query answering in an RDF database strongly depends on the data layout, that is, the way data is split in persistent data structures. We consider answering Basic Graph Pattern Queries (BGPQs), and in particular those with variables (also) in class and property positions, in the presence of RDFS ontologies, both through data saturation and query reformulation. We show that such demanding queries often lead to inefficient query answering on two popular storage layouts, so-called T and CP. We present novel query answering algorithms on the TCP layout, which combines T and CP. In exchange to occupying more storage space, e.g. on an inexpensive disk, TCP avoids the bad or even catastrophic performance that T and/or CP sometimes exhibit. We also introduce summary-based pruning, a novel technique based on existing RDF quotient summaries, which improves query answering performance on the T, CP and the more robust TCP layouts.

Original languageEnglish
Pages (from-to)17-32
Number of pages16
JournalCEUR Workshop Proceedings
Volume2757
Publication statusPublished - 1 Jan 2020
Event12th International Workshop on Scalable Semantic Web Knowledge Base Systems, SSWS 2020 - Athens, Greece
Duration: 2 Nov 2020 → …

Fingerprint

Dive into the research topics of 'Revisiting RDF storage layouts for efficient query answering'. Together they form a unique fingerprint.

Cite this