Publication

The finite implication problem for expressive XML keys: Foundations, applications, and performance evaluation

Ferrarotti, Flavio
Hartmann, Sven
Link, Sebastian
Marin, Mauricio
Muñoz, Emir
Citation
Ferrarotti, Flavio, Hartmann, Sven, Link, Sebastian, Marin, Mauricio, & Muñoz, Emir. (2013). The Finite Implication Problem for Expressive XML Keys: Foundations, Applications, and Performance Evaluation. In A. Hameurlain, J. Küng, R. Wagner, S. W. Liddle, K.-D. Schewe & X. Zhou (Eds.), Transactions on Large-Scale Data- and Knowledge-Centered Systems X: Special Issue on Database- and Expert-Systems Applications (pp. 60-94). Berlin, Heidelberg: Springer Berlin Heidelberg.
Abstract
The increasing popularity of XML for persistent data storage, processing and exchange has triggered the demand for efficient algorithms to manage XML data. Both industry and academia have long since recognized the importance of keys in XML data management. In this paper we make a theoretical as well as a practical contribution to this area. This endeavour is ambitious given the multitude of intractability results that have been established. Our theoretical contribution is based in the definition of a new fragment of XML keys that keeps the right balance between expressiveness and efficiency of maintenance. More precisely, we characterize the associated implication problem axiomatically and develop a low-degree polynomial time decision algorithm. In comparison to previous work, this new fragment of XML keys provides designers with an enhanced ability to capture properties of XML data that are significant for the application at hand. Our practical contribution includes an efficient implementation of this decision algorithm and a thorough evaluation of its performance, demonstrating that reasoning about expressive notions of XML keys can be done efficiently in practice, and scales well. Our results promote the use of XML keys on real-world XML practice, where a little more semantics makes applications a lot more effective. To exemplify this potential, we use the decision algorithm to calculate non-redundant covers for sets of XML keys. In turn, this allow us to reduce significantly the time required to validate large XML documents against keys from the proposed fragment.
Funder
Publisher
Springer
Publisher DOI
Rights
Attribution-NonCommercial-NoDerivs 3.0 Ireland