Mining cardinalities from knowledge bases
Muñoz, Emir ; Nickles, Matthias
Muñoz, Emir
Nickles, Matthias
Loading...
Repository DOI
Publication Date
2017-08-01
Type
Conference Paper
Downloads
Citation
Muñoz, Emir, & Nickles, Matthias. (2017). Mining Cardinalities from Knowledge Bases. In Djamal Benslimane, Ernesto Damiani, William I. Grosky, Abdelkader Hameurlain, Amit Sheth & Roland R. Wagner (Eds.), Database and Expert Systems Applications: 28th International Conference, DEXA 2017, Lyon, France, August 28-31, 2017, Proceedings, Part I (pp. 447-462). Cham: Springer International Publishing.
Abstract
Cardinality is an important structural aspect of data that has not received enough attention in the context of RDF knowledge bases (KBs). Information about cardinalities can be useful for data users and knowledge engineers when writing queries, reusing or engineering KBs. Such cardinalities can be declared using OWL and RDF constraint languages as constraints on the usage of properties over instance data. However, their declaration is optional and consistency with the instance data is not ensured. In this paper, we address the problem of mining cardinality bounds for properties to discover structural characteristics of KBs, and use these bounds to assess completeness. Because KBs are incomplete and error-prone, we apply statistical methods for filtering property usage and for finding accurate and robust patterns. Accuracy of the cardinality patterns is ensured by properly handling equality axioms (owl:sameAs); and robustness by filtering outliers. We report an implementation of our algorithm with two variants using SPARQL 1.1 and Apache Spark, and their evaluation on real-world and synthetic data.
Funder
Publisher
Springer Verlag
Publisher DOI
10.1007/978-3-319-64468-4_34
Rights
Attribution-NonCommercial-NoDerivs 3.0 Ireland