Ontologies
On Expansion and Contraction of DL-Lite Knowledge Bases
Zheleznyakov, Dmitriy, Kharlamov, Evgeny, Nutt, Werner, Calvanese, Diego
Knowledge bases (KBs) are not static entities: new information constantly appears and some of the previous knowledge becomes obsolete. In order to reflect this evolution of knowledge, KBs should be expanded with the new knowledge and contracted from the obsolete one. This problem is well-studied for propositional but much less for first-order KBs. In this work we investigate knowledge expansion and contraction for KBs expressed in DL-Lite, a family of description logics (DLs) that underlie the tractable fragment OWL 2 QL of the Web Ontology Language OWL 2. We start with a novel knowledge evolution framework and natural postulates that evolution should respect, and compare our postulates to the well-established AGM postulates. We then review well-known model and formula-based approaches for expansion and contraction for propositional theories and show how they can be adapted to the case of DL-Lite. In particular, we show intrinsic limitations of model-based approaches: besides the fact that some of them do not respect the postulates we have established, they ignore the structural properties of KBs. This leads to undesired properties of evolution results: evolution of DL-Lite KBs cannot be captured in DL-Lite. Moreover, we show that well-known formula-based approaches are also not appropriate for DL-Lite expansion and contraction: they either have a high complexity of computation, or they produce logical theories that cannot be expressed in DL-Lite. Thus, we propose a novel formula-based approach that respects our principles and for which evolution is expressible in DL-Lite. For this approach we also propose polynomial time deterministic algorithms to compute evolution of DL-Lite KBs when evolution affects only factual data.
A Journey into Ontology Approximation: From Non-Horn to Hon
Haga, Anneke, Lutz, Carsten, Marti, Johannes, Wolter, Frank
We study complete approximations of an ontology formulated in a non-Horn description logic (DL) such as $\mathcal{ALC}$ in a Horn DL such as~$\mathcal{EL}$. We provide concrete approximation schemes that are necessarily infinite and observe that in the $\mathcal{ELU}$-to-$\mathcal{EL}$ case finite approximations tend to exist in practice and are guaranteed to exist when the original ontology is acyclic. In contrast, neither of this is the case for $\mathcal{ELU}_\bot$-to-$\mathcal{EL}_\bot$ and for $\mathcal{ALC}$-to-$\mathcal{EL}_\bot$ approximations. We also define a notion of approximation tailored towards ontology-mediated querying, connect it to subsumption-based approximations, and identify a case where finite approximations are guaranteed to exist.
Machine Understandable Policies and GDPR Compliance Checking
Bonatti, Piero A., Kirrane, Sabrina, Petrova, Iliana M., Sauro, Luigi
Ea ch process description is shaped like a formalized business policy consisting of the following set of features: - the file(s) to be processed; - the software that carries out the processing; - the purpose of the processing; - the entities that can access the results of the processing; - the details of where the results are stored and for how long; - the obligations that are fulfilled while (or before) carrying out the processing; - the legal basis of the processing. It is not hard to see that the first five elements in the above list match SPECIAL's usage policy language (UPL) introduced in Section 3. As far as the above elements are concerned, the only difference between UPL expressions and a business policy is the granularity of attribute values. Fo r example, the involved data (specified in the first element of the above list) are not expressed as a general, content-oriented category, but rather as a concrete set of data sourc es or data items. Such objects can be modeled as instances or subclasses of the general data categories illustrated in Section 3, thereby creating a link between digital artifacts and usage policies. Similar considerations hold for the other a t-tributes: - processing is not necessarily described in the abstract terms adopted by the processing vocabulary introduced in Section 3; in a business policy, this can be specified by naming concrete software procedures; - the purpose of data processing may be directly related to the data controller's mission and products; - recipients may consist of a concrete list of legal and/or physical persons, as opposed to general categories such as Ours or ThirdParty; - storage may be specified by a list of specific data repositories, at the level of files and hosts. With this level of granularity, specific authorizations can be derived from the business policy, for example: The indicated software procedure can read the indicated data sources. The results can be written in the specified repositories. The specified recipients can read the repositories...
Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute
Viard, Tiphaine, McLachlan, Thomas, Ghader, Hamidreza, Sekine, Satoshi
Wikipedia is a huge opportunity for machine learning, being the largest semi-structured base of knowledge available. Because of this, many works examine its contents, and focus on structuring it in order to make it usable in learning tasks, for example by classifying it into an ontology. Beyond its textual contents, Wikipedia also displays a typical graph structure, where pages are linked together through citations. In this paper, we address the task of integrating graph ( i.e. structure) information to classify Wikipedia into a fine-grained named entity ontology (NE), the Extended Named Entity hierarchy. To address this task, we first start by assessing the relevance of the graph structure for NE classification. We then explore two directions, one related to feature vectors using graph descriptors commonly used in large-scale network analysis, and one extending flat classification to a weighted model taking into account semantic similarity. We conduct at-scale practical experiments, on a manually labeled subset of 22,000 pages extracted from the Japanese Wikipedia. Our results show that integrating graph information succeeds at reducing sparsity of the input feature space, and yields classification results that are comparable or better than previous works.
A Neural Architecture for Person Ontology population
Ganesan, Balaji, Dasgupta, Riddhiman, Parekh, Akshay, Patel, Hima, Reinwald, Berthold
A person ontology comprising concepts, attributes and relationships of people has a number of applications in data protection, de-identification, population of knowledge graphs for business intelligence and fraud prevention. While artificial neural networks have led to improvements in Entity Recognition, Entity Classification, and Relation Extraction, creating an ontology largely remains a manual process, because it requires a fixed set of semantic relations between concepts. In this work, we present a system for automatically populating a person ontology graph from unstructured data using neural models for Entity Classification and Relation Extraction. We introduce a new dataset for these tasks and discuss our results. Introduction We can define Personal Data Entity (PDE) as any information about a person.
Provenance for the Description Logic ELHr
Bourgaux, Camille, Ozaki, Ana, Peñaloza, Rafael, Predoiu, Livia
We address the problem of handling provenance information in ELHr ontologies. We consider a setting recently introduced for ontology-based data access, based on semirings and extending classical data provenance, in which ontology axioms are annotated with provenance tokens. A consequence inherits the provenance of the axioms involved in deriving it, yielding a provenance polynomial as an annotation. We analyse the semantics for the ELHr case and show that the presence of conjunctions poses various difficulties for handling provenance, some of which are mitigated by assuming multiplicative idempotency of the semiring. Under this assumption, we study three problems: ontology completion with provenance, computing the set of relevant axioms for a consequence, and query answering.
A multi-agent ontologies-based clinical decision support system
Shen, Ying, Armelle, Jacquet-Andrieu, Colloc, Joël
Clinical decision support systems combine knowledge and data from a variety of sources, represented by quantitative models based on stochastic methods, or qualitative based rather on expert heuristics and deductive reasoning. At the same time, case-based reasoning (CBR) memorizes and returns the experience of solving similar problems. The cooperation of heterogeneous clinical knowledge bases (knowledge objects, semantic distances, evaluation functions, logical rules, databases...) is based on medical ontologies. A multi-agent decision support system (MADSS) enables the integration and cooperation of agents specialized in different fields of knowledge (semiology, pharmacology, clinical cases, etc.). Each specialist agent operates a knowledge base defining the conduct to be maintained in conformity with the state of the art associated with an ontological basis that expresses the semantic relationships between the terms of the domain in question. Our approach is based on the specialization of agents adapted to the knowledge models used during the clinical steps and ontologies. This modular approach is suitable for the realization of MADSS in many areas.
Negative Statements Considered Useful
Arnaout, Hiba, Razniewski, Simon, Weikum, Gerhard
Knowledge bases (KBs), pragmatic collections of knowledge about notable entities, are an important asset in applications such as search, question answering and dialogue. Rooted in a long tradition in knowledge representation, all popular KBs only store positive information, while they abstain from taking any stance towards statements not contained in them. In this paper, we make the case for explicitly stating interesting statements which are not true. Negative statements would be important to overcome current limitations of question answering, yet due to their potential abundance, any effort towards compiling them needs a tight coupling with ranking. We introduce two approaches towards compiling negative statements. (i) In peer-based statistical inferences, we compare entities with highly related entities in order to derive potential negative statements, which we then rank using supervised and unsupervised features. (ii) In query-log-based text extraction, we use a pattern-based approach for harvesting search engine query logs. Experimental results show that both approaches hold promising and complementary potential. Along with this paper, we publish the first datasets on interesting negative information, containing over 1.1M statements for 100K popular Wikidata entities.
Correcting Knowledge Base Assertions
Chen, Jiaoyan, Chen, Xi, Horrocks, Ian, Jimenez-Ruiz, Ernesto, Myklebus, Erik B.
The usefulness and usability of knowledge bases (KBs) is often limited by quality issues. One common issue is the presence of erroneous assertions, often caused by lexical or semantic confusion. We study the problem of correcting such assertions, and present a general correction framework which combines lexical matching, semantic embedding, soft constraint mining and semantic consistency checking. The framework is evaluated using DBpedia and an enterprise medical KB.
Knowledge Integration of Collaborative Product Design Using Cloud Computing Infrastructure
Bohlouli, Mahdi, Holland, Alexander, Fathi, Madjid
-- T he pivotal key for the success of manufacturing enterprises is sustainable and innovative product design and development. In collaborative design, stakehol ders are heterogeneously distributed chain - like . Due to the growing volume of data and knowledge, an effective management of the knowledge acquired in the product design and development is one of the key challenges facing most manufacturing enterprises. Opportunities for improving efficiency and performance of IT - based product design applications through centralization of resources such as knowledge and computation have increased in the last few years with maturation of technologies such as SOA, virtualization, grid computing, and /or cloud computing. The main focus of this paper is the concept of ongoing research in providing the knowledge integration service for collaborative product design and development using cloud computing infra structure . P otential s of the cloud computing to support the Knowledge integration functionalities as a Service by providing functionalities such as knowledge mapping, merging, searching, and transferring in product design procedure are described in this paper . Proposed knowledge integration services support users by giving real - time access to knowledge resources. The framework has the advantage of availability, efficiency, cost reduction, less time to result, and scalability . Changes made during the early design stage do not cause the significant increase in costs, while during the production stage, sharp increase in costs will occur since many blueprints, design documents or components would require re - work and re - design [ 5 ] . Today's research is focused on optimising the development methodologies to enable shorter time, lower costs and higher quality of the systems [ 2 ] . The pivotal key for the success of manufacturing enterprises is sustainable and innovative product design and development . In order to achieve this goal, it is required to have a real and deep knowledge of former and current procedures in the manufacturing enterprise [4] and future needs as well as customer feedback s and various stages of production cha in activities. Realization of an efficient knowledge transfer between different stakeholders of product development process such as linking customers and suppliers proactively throughout the entire value chain, and collaborating across boundaries in distri buted enterprise s is reinforcing this trend.