TY - GEN
T1 - Optimizing reformulation-based query answering in RDF
AU - Bursztyn, Damian
AU - Goasdoué, François
AU - Manolescu, Ioana
N1 - Publisher Copyright:
© 2015, Copyright is with the authors.
PY - 2015/1/1
Y1 - 2015/1/1
N2 - Reformulation-based query answering is a query processing technique aiming at answering queries under constraints. It consists of reformulating the query based on the constraints, so that evaluating the reformulated query directly against the data (i.e., without considering any more the constraints) produces the correct answer set. In this paper, we consider optimizing reformulation-based query answering in the setting of ontology-based data access, where SPARQL conjunctive queries are posed against RDF facts on which constraints expressed by an RDF Schema hold. The literature provides query reformulation algorithms for many fragments of RDF. However, reformulated queries may be complex, thus may not be efficiently processed by a query engine; well established query engines even fail processing them in some cases. Our contribution is (i) to generalize prior query reformulation languages, leading to investigating a space of reformulated queries we call JUCQs (joins of unions of conjunctive queries), instead of a single reformulation; and (ii) an effective and efficient cost-based algorithm for selecting from this space, the reformulated query with the lowest estimated cost. Our experiments show that our technique enables reformulation-based query answering where the state-of-the-art approaches are simply unfeasible, while it may decrease its cost by orders of magnitude in other cases.
AB - Reformulation-based query answering is a query processing technique aiming at answering queries under constraints. It consists of reformulating the query based on the constraints, so that evaluating the reformulated query directly against the data (i.e., without considering any more the constraints) produces the correct answer set. In this paper, we consider optimizing reformulation-based query answering in the setting of ontology-based data access, where SPARQL conjunctive queries are posed against RDF facts on which constraints expressed by an RDF Schema hold. The literature provides query reformulation algorithms for many fragments of RDF. However, reformulated queries may be complex, thus may not be efficiently processed by a query engine; well established query engines even fail processing them in some cases. Our contribution is (i) to generalize prior query reformulation languages, leading to investigating a space of reformulated queries we call JUCQs (joins of unions of conjunctive queries), instead of a single reformulation; and (ii) an effective and efficient cost-based algorithm for selecting from this space, the reformulated query with the lowest estimated cost. Our experiments show that our technique enables reformulation-based query answering where the state-of-the-art approaches are simply unfeasible, while it may decrease its cost by orders of magnitude in other cases.
U2 - 10.5441/002/edbt.2015.24
DO - 10.5441/002/edbt.2015.24
M3 - Conference contribution
AN - SCOPUS:84951192962
T3 - EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings
SP - 265
EP - 276
BT - EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings
A2 - Popa, Lucian
A2 - Alonso, Gustavo
A2 - Van den Bussche, Jan
A2 - Barcelo, Pablo
A2 - Teubner, Jens
A2 - Paredaens, Jan
A2 - Ugarte, Martin
A2 - Geerts, Floris
PB - OpenProceedings.org, University of Konstanz, University Library
T2 - 18th International Conference on Extending Database Technology, EDBT 2015
Y2 - 23 March 2015 through 27 March 2015
ER -