Ontologies
Equality-Friendly Well-Founded Semantics and Applications to Description Logics
Gottlob, Georg (University of Oxford) | Hernich, André (Humboldt-Universitaet zu Berlin) | Kupke, Clemens (University of Oxford) | Lukasiewicz, Thomas (University of Oxford)
We tackle the problem of defining a well-founded semantics for Datalog rules with existentially quantified variables in their heads and negations in their bodies. In particular, we provide a well-founded semantics (WFS) for the recent Datalog+/- family of ontology languages, which covers several important description logics (DLs). To do so, we generalize Datalog+/- by non-stratified nonmonotonic negation in rule bodies, and we define a WFS for this generalization via guarded fixed-point logic. We refer to this approach as equality-friendly WFS, since it has the advantage that it does not make the unique name assumption (UNA); this brings it close to OWL and its profiles as well as typical DLs, which also do not make the UNA. We prove that for guarded Datalog+/- with negation under the equality-friendly WFS, conjunctive query answering is decidable, and we provide precise complexity results for this problem. From these results, we obtain precise definitions of the standard WFS extensions of EL and of members of the DL-Lite family, as well as corresponding complexity results for query answering.
Benchmarking Ontology-Based Query Rewriting Systems
Imprialou, Martha (University of Oxford) | Stoilos, Giorgos (National Technical University of Athens) | Grau, Bernardo Cuenca (University of Oxford)
Query rewriting is a prominent reasoning technique in ontology-based data access applications. A wide variety of query rewriting algorithms have been proposed in recent years and implemented in highly optimised reasoning systems. Query rewriting systems are complex software programs; even if based on provably correct algorithms, sophisticated optimisations make the systems more complex and errors become more likely to happen. In this paper, we present an algorithm that, given an ontology as input, synthetically generates ``relevant'' test queries. Intuitively, each of these queries can be used to verify whether the system correctly performs a certain set of ``inferences'', each of which can be traced back to axioms in the input ontology. Furthermore, we present techniques that allow us to determine whether a system is unsound and/or incomplete for a given test query and ontology. Our evaluation shows that most publicly available query rewriting systems are unsound and/or incomplete, even on commonly used benchmark ontologies; more importantly, our techniques revealed the precise causes of their correctness issues and the systems were then corrected based on our feedback. Finally, since our evaluation is based on a larger set of test queries than existing benchmarks, which are based on hand-crafted queries, it also provides a better understanding of the scalability behaviour of each system.
SPARQL Query Containment Under SHI Axioms
Chekol, Melisachew Wudage (INRIA and LIG) | Euzenat, Jérôme (INRIA and LIG) | Genevès, Pierre (CNRS) | Layaïda, Nabil (INRIA and LIG)
SPARQL query containment under schema axioms is the problem of determining whether, for any RDF graph satisfying a given set of schema axioms, the answers to a query are contained in the answers of another query. This problem has major applications for verification and optimization of queries. In order to solve it, we rely on the mu-calculus. Firstly, we provide a mapping from RDF graphs into transition systems. Secondly, SPARQL queries and RDFS and SHI axioms are encoded into mu-calculus formulas. This allows us to reduce query containment and equivalence to satisfiability in the mu-calculus. Finally, we prove a double exponential upper bound for containment under SHI schema axioms.
Query Rewriting for Horn-SHIQ Plus Rules
Eiter, Thomas (Vienna University of Technology) | Ortiz, Magdalena (Vienna University of Technology) | Simkus, Mantas (Vienna University of Technology) | Tran, Trung-Kien (Vrije Universiteit Brussel) | Xiao, Guohui (Vienna University of Technology)
Query answering over Description Logic (DL) ontologies has become a vibrant field of research. Efficient realizations often exploit database technology and rewrite a given query to an equivalent SQL or Datalog query over a database associated with the ontology. This approach has been intensively studied for conjunctive query answering in the DL-Lite and EL families, but is much less explored for more expressive DLs and queries. We present a rewriting-based algorithm for conjunctive query answering over Horn-SHIQ ontologies, possibly extended with recursive rules under limited recursion as in DL+log. This setting not only subsumes both DL-Lite and EL, but also yields an algorithm for answering (limited) recursive queries over Horn-SHIQ ontologies (an undecidable problem for full recursive queries). A prototype implementation shows its potential for applications, as experiments exhibit efficient query answering over full Horn-SHIQ ontologies and benign downscaling to DL-Lite, where it is competitive with comparable state of the art systems.
REWOrD: Semantic Relatedness in the Web of Data
Pirró, Giuseppe (Free University of Bolzano-Bozen)
This paper presents REWOrD, an approach to compute semantic relatedness between entities in the Web of Data representing real word concepts. REWOrD exploits the graph nature of RDF data and the SPARQL query language to access this data. Through simple queries, REWOrD constructs weighted vectors keeping the informativeness of RDF predicates used to make statements about the entities being compared. The most informative path is also considered to further refine informativeness. Relatedness is then computed by the cosine of the weighted vectors. Differently from previous approaches based on Wikipedia, REWOrD does not require any prepro- cessing or custom data transformation. Indeed, it can lever- age whatever RDF knowledge base as a source of background knowledge. We evaluated REWOrD in different settings by using a new dataset of real word entities and investigate its flexibility. As compared to related work on classical datasets, REWOrD obtains comparable results while, on one side, it avoids the burden of preprocessing and data transformation and, on the other side, it provides more flexibility and applicability in a broad range of domains.
Ontology-Based Data Access with Dynamic TBoxes in DL-Lite
Pinto, Floriana Di (Sapienza University of Rome) | Giacomo, Giuseppe De (Sapienza University of Rome) | Lenzerini, Maurizio (Sapienza University of Rome) | Rosati, Riccardo (Sapienza University of Rome)
In this paper we introduce the notion of mapping-based knowledge base (MKB) to formalize the situation where both the extensional and the intensional level of the ontology are determined by suitable mappings to a set of (relational) data sources. This allows for making the intensional level of the ontology as dynamic as traditionally the extensional level is. To do so, we resort to the meta-modeling capabilities of higher-order Description Logics, which allow us to see concepts and roles as individuals, and vice versa. The challenge in this setting is to design tractable query answering algorithms. Besides the definition of MKBs, our main result is that answering instance queries posed to MKBs expressed in Hi(DL-LiteR) can be done efficiently. In particular, we define a query rewriting technique that produces first-order (SQL) queries to be posed to the data sources.
Usage-Centric Benchmarking of RDF Triple Stores
Morsey, Mohamed (AKSW Research Group University of Leipzig) | Lehmann, Jens (AKSW Research Group University of Leipzig) | Auer, Sören (AKSW Research Group University of Leipzig) | Ngomo, Axel-Cyrille Ngonga (AKSW Research Group University of Leipzig)
A central component in many applications is the underlying data management layer. In Data-Web applications, the central component of this layer is the triple store. It is thus evident that finding the most adequate store for the application to develop is of crucial importance for individual projects as well as for data integration on the Data Web in general. In this paper, we propose a generic benchmark creation procedure for SPARQL, which we apply to the DBpedia knowledge base. In contrast to previous approaches, our benchmark is based on queries that were actually issued by humans and applications against existing RDF data not resembling a relational schema. In addition, our approach does not only take the query string but also the features of the queries into consideration during the benchmark generation process. Our generic procedure for benchmark creation is based on query-log mining, SPARQL feature analysis and clustering. After presenting the method underlying our benchmark generation algorithm, we use the generated benchmark to compare the popular triple store implementations Virtuoso, Sesame, Jena-TDB, and BigOWLIM.
Preface
Srivastava, Biplav (IBM T.J. Watson Research Center, Hawthorne)
We will like to call cities that enable such capabilities as, "semantic cities." In a semantic city, available resources are harnessed safely, sustainably and efficiently to achieve positive, measurable economic and societal outcomes. Enabling city information as a utility, through a robust (expressive, dynamic, scalable) and (critically) a sustainable technology and socially synergistic ecosystem could drive significant benefits and opportunities. Data (and then information and knowledge) from people, systems, and things is the single most scalable resource available to city stakeholders to reach the objective of semantic cities. Two major trends are supporting semantic cities -- open data and semantic web.
Capturing the Pulse of Cities: Opportunity and Research Challenges for Robust Stream Data Reasoning
Lecue, Freddy (IBM Research, Smarter Cities Technology Centre) | Kotoulas, Spyros (IBM Research, Smarter Cities Technology Centre) | Aonghusa, Pol Mac (IBM Research, Smarter Cities Technology Centre)
In a Smarter City, available resources are harnessed safely, sustainably and efficiently to achieve positive, measurable economic and societal outcomes. Data and information from people, systems and things is the single most scalable resource available to city stakeholders but difficult to publish, organize, discover and consume, especially in a real-time context. Enabling city information as a utility, through a robust (expressive, dynamic, scalable) and (critically) a sustainable technology and socially synergistic ecosystem, could drive significant benefits and opportunities. In the context of stream data (as real-time, gigantic, noisy and private data), this paper targets research issues we identify as important to harness the fused information resources of cities, Citizens and Stakeholders to reach the concept of Smarter Cities.
Building a Timeline Network for Evacuation in Earthquake Disaster
Nguyen, The Minh (The University of Electro-Communications) | Kawamura, Takahiro (The University of Electro-Communications) | Tahara, Yasuyuki (The University of Electro-Communications) | Ohsuga, Akihiko (The University of Electro-Communications)
In this paper, we propose an approach that automatically extract users’ activities in sentences retrieved from Twitter. We then design a timeline action networkbased on Web Ontology Language (OWL). By using the proposed activity extraction approach, we can automatically collect data for the action network. Finally, we propose a novel action-based collaborative filtering, which predicts missing activity data, in order to complement this timeline network. Moreover, with a combination of collaborative filtering and natural language processing (NLP), our method can deal with minority actions such as successful actions. Based on evaluation of tweets which related to the massive Tohoku earthquake,we indicated that our timeline action network can provide useful action patterns in real-time. Not only earthquake disaster, our research can also be applied to other disasters and business models, such as typhoon,travel, marketing, etc.