TY - GEN
T1 - Recursive queries on trees and data trees
AU - Abiteboul, Serge
AU - Bourhis, Pierre
AU - Muscholl, Anca
AU - Wu, Zhilin
PY - 2013/4/4
Y1 - 2013/4/4
N2 - The analysis of datalog programs over relational structures has been studied in depth, most notably the problem of containment. The analysis problems that have been considered were shown to be undecidable with the exception of (i) containment of arbitrary programs in nonrecursive ones, (ii) containment of monadic programs, and (iii) emptiness. In this paper, we are concerned with a much less studied problem, the analysis of datalog programs over data trees. We show that the analysis of datalog programs is more complex for data trees than for arbitrary structures. In particular, we prove that the three aforementioned problems are undecidable for data trees. But in practice, data trees (e.g., XML trees) are often of bounded depth. We prove that all three problems are decidable over bounded depth data trees. Another contribution of the paper is the study of a new form of automata called pattern automata, that are essentially equivalent to linear datalog programs. We use pattern automata to show that the emptiness problem for linear monadic datalog programs with data value inequalities is decidable over arbitrary data trees.
AB - The analysis of datalog programs over relational structures has been studied in depth, most notably the problem of containment. The analysis problems that have been considered were shown to be undecidable with the exception of (i) containment of arbitrary programs in nonrecursive ones, (ii) containment of monadic programs, and (iii) emptiness. In this paper, we are concerned with a much less studied problem, the analysis of datalog programs over data trees. We show that the analysis of datalog programs is more complex for data trees than for arbitrary structures. In particular, we prove that the three aforementioned problems are undecidable for data trees. But in practice, data trees (e.g., XML trees) are often of bounded depth. We prove that all three problems are decidable over bounded depth data trees. Another contribution of the paper is the study of a new form of automata called pattern automata, that are essentially equivalent to linear datalog programs. We use pattern automata to show that the emptiness problem for linear monadic datalog programs with data value inequalities is decidable over arbitrary data trees.
U2 - 10.1145/2448496.2448509
DO - 10.1145/2448496.2448509
M3 - Conference contribution
AN - SCOPUS:84875592918
SN - 9781450315982
T3 - ACM International Conference Proceeding Series
SP - 93
EP - 104
BT - ICDT 2013 - 16th International Conference on Database Theory, Proceedings
T2 - 16th International Conference on Database Theory, ICDT 2013
Y2 - 18 March 2013 through 22 March 2013
ER -