Ontologies
A Polynomial Time Subsumption Algorithm for Nominal Safe $\mathcal{ELO}_\bot$ under Rational Closure
Casini, Giovanni, Straccia, Umberto, Meyer, Thomas
Description Logics (DLs) under Rational Closure (RC) is a well-known framework for non-monotonic reasoning in DLs. In this paper, we address the concept subsumption decision problem under RC for nominal safe $\mathcal{ELO}_\bot$, a notable and practically important DL representative of the OWL 2 profile OWL 2 EL. Our contribution here is to define a polynomial time subsumption procedure for nominal safe $\mathcal{ELO}_\bot$ under RC that relies entirely on a series of classical, monotonic $\mathcal{EL}_\bot$ subsumption tests. Therefore, any existing classical monotonic $\mathcal{EL}_\bot$ reasoner can be used as a black box to implement our method. We then also adapt the method to one of the known extensions of RC for DLs, namely Defeasible Inheritance-based DLs without losing the computational tractability.
Generating Ontologies from Templates: A Rule-Based Approach for Capturing Regularity
Forssell, Henrik, Kindermann, Christian, Lupp, Daniel P., Sattler, Uli, Thorstensen, Evgenij
We present a second-order language that can be used to succinctly specify ontologies in a consistent and transparent manner. This language is based on ontology templates (OTTR), a framework for capturing recurring patterns of axioms in ontological modelling. The language and our results are independent of any specific DL. We define the language and its semantics, including the case of negation-as-failure, investigate reasoning over ontologies specified using our language, and show results about the decidability of useful reasoning tasks about the language itself. We also state and discuss some open problems that we believe to be of interest.
Semantically Enhanced Models for Commonsense Knowledge Acquisition
Alhussien, Ikhlas, Cambria, Erik, NengSheng, Zhang
Abstract--Commonsense knowledge is paramount to enable intelligent systems. Typically, it is characterized as being implicit and ambiguous, hindering thereby the automation of its acquisition. To address these challenges, this paper presents semantically enhanced models to enable reasoning through resolving part of commonsense ambiguity. The proposed models enhance in a knowledge graph embedding framework for knowledge base completion. Experimental results show the effectiveness of the new semantic models in commonsense reasoning. Intelligent systems need to acquire humanlike knowledge in order to perform smart decision making. This type of knowledge which is often termed commonsense knowledge refers to the agreed-upon facts and information about everyday world that is assumed to be shared by everyone.
Medical Knowledge Embedding Based on Recursive Neural Network for Multi-Disease Diagnosis
Jiang, Jingchi, Wang, Huanzheng, Xie, Jing, Guo, Xitong, Guan, Yi, Yu, Qiubin
The representation of knowledge based on first-order logic captures the richness of natural language and supports multiple probabilistic inference models. Although symbolic representation enables quantitative reasoning with statistical probability, it is difficult to utilize with machine learning models as they perform numerical operations. In contrast, knowledge embedding (i.e., high-dimensional and continuous vectors) is a feasible approach to complex reasoning that can not only retain the semantic information of knowledge but also establish the quantifiable relationship among them. In this paper, we propose recursive neural knowledge network (RNKN), which combines medical knowledge based on first-order logic with recursive neural network for multi-disease diagnosis. After RNKN is efficiently trained from manually annotated Chinese Electronic Medical Records (CEMRs), diagnosis-oriented knowledge embeddings and weight matrixes are learned. Experimental results verify that the diagnostic accuracy of RNKN is superior to that of some classical machine learning models and Markov logic network (MLN). The results also demonstrate that the more explicit the evidence extracted from CEMRs is, the better is the performance achieved. RNKN gradually exhibits the interpretation of knowledge embeddings as the number of training epochs increases.
Arianna+: Scalable Human Activity Recognition by Reasoning with a Network of Ontologies
Kareem, Syed Yusha, Buoncompagni, Luca, Mastrogiovanni, Fulvio
Aging population ratios are rising significantly. Meanwhile, smart home based health monitoring services are evolving rapidly to become a viable alternative to traditional healthcare solutions. Such services can augment qualitative analyses done by gerontologists with quantitative data. Hence, the recognition of Activities of Daily Living (ADL) has become an active domain of research in recent times. For a system to perform human activity recognition in a real-world environment, multiple requirements exist, such as scalability, robustness, ability to deal with uncertainty (e.g., missing sensor data), to operate with multi-occupants and to take into account their privacy and security. This paper attempts to address the requirements of scalability and robustness, by describing a reasoning mechanism based on modular spatial and/or temporal context models as a network of ontologies. The reasoning mechanism has been implemented in a smart home system referred to as Arianna+. The paper presents and discusses a use case, and experiments are performed on a simulated dataset, to showcase Arianna+'s modularity feature, internal working, and computational performance. Results indicate scalability and robustness for human activity recognition processes.
Probabilistic DL Reasoning with Pinpointing Formulas: A Prolog-based Approach
Zese, Riccardo, Bellodi, Elena, Cota, Giuseppe, Lamma, Evelina, Riguzzi, Fabrizio
When modeling real world domains we have to deal with information that is incomplete or that comes from sources with different trust levels. This motivates the need for managing uncertainty in the Semantic Web. To this purpose, we introduced a probabilistic semantics, named DISPONTE, in order to combine description logics with probability theory. The probability of a query can be then computed from the set of its explanations by building a Binary Decision Diagram (BDD). The set of explanations can be found using the tableau algorithm, which has to handle non-determinism. Prolog, with its efficient handling of non-determinism, is suitable for implementing the tableau algorithm. TRILL and TRILLP are systems offering a Prolog implementation of the tableau algorithm. TRILLP builds a pinpointing formula, that compactly represents the set of explanations and can be directly translated into a BDD. Both reasoners were shown to outperform state-of-the-art DL reasoners. In this paper, we present an improvement of TRILLP, named TORNADO, in which the BDD is directly built during the construction of the tableau, further speeding up the overall inference process. An experimental comparison shows the effectiveness of TORNADO. All systems can be tried online in the TRILL on SWISH web application at http://trill.ml.unife.it/.
The Space-Efficient Core of Vadalog
Berger, Gerald, Gottlob, Georg, Pieris, Andreas, Sallinger, Emanuel
Vadalog is a system for performing complex reasoning tasks such as those required in advanced knowledge graphs. The logical core of the underlying Vadalog language is the warded fragment of tuple-generating dependencies (TGDs). This formalism ensures tractable reasoning in data complexity, while a recent analysis focusing on a practical implementation led to the reasoning algorithm around which the Vadalog system is built. A fundamental question that has emerged in the context of Vadalog is the following: can we limit the recursion allowed by wardedness in order to obtain a formalism that provides a convenient syntax for expressing useful recursive statements, and at the same time achieves space-efficiency? After analyzing several real-life examples of warded sets of TGDs provided by our industrial partners, as well as recent benchmarks, we observed that recursion is often used in a restricted way: the body of a TGD contains at most one atom whose predicate is mutually recursive with a predicate in the head. We show that this type of recursion, known as piece-wise linear in the Datalog literature, is the answer to our main question. We further show that piece-wise linear recursion alone, without the wardedness condition, is not enough as it leads to the undecidability of reasoning. We finally study the relative expressiveness of the query languages based on (piece-wise linear) warded sets of TGDs.
An Ontology-Based Artificial Intelligence Model for Medicine Side-Effect Prediction: Taking Traditional Chinese Medicine as An Example
Wang, Zeheng, Lu, Kun, Cao, Jun, Yao, Yuanzhe, Li, Liang, Liu, Runyu, Liu, Zhiyuan, Yan, Jing
Artificial intelligence is a modern technology that is utilized in various fields of medicine [1-3]. At the meantime, Chinese Traditional Medicine (TCM) is now widely considered as a promising alternative medicine for complementary treatment in cancers or chronic diseases due to the effective methodology practically developed by generations of doctors for almost 4000 years [4]. Based on previous verification, it is undeniable that there are many correlations between the TCM syndromes and western diseases, turning out novel approaches for enhancing the treatment efficiency and developing medicines regarding with TCM methodologies [5]. Unfortunately, hindered by the remarkable gap between the modern informatics and the fundament of TCM: antient Chinese philosophy, such correlations are still too elusive to be formulated precisely. Therefore, recently, in order to figure out the deep connection between modern science and TCM, the research combining TCM with AI for valid knowledge acquisition and mining attracts extremely attention, and hereby, leading to many profound works, such as ontology information system design [6], latent tree models design [7], TCM warehouse for AI application [8], and digital knowledge graph development [2]. On the other hand, researchers face, however, many difficulties in setting up AI for TCM in terms of directly interpreting TCM semantic system (almost recorded by ancient Chinese doctrines) into structured database. Because in this way, considerable workload must be undertaken by limited numbers of experts who are proficient in both AI and TCM to translate the TCM terminologies and then formulate the modern model thereof. In contrast, as shown in Figure 1, the digestion of using TCM methodology in dealing with issues of modern science, new medicine design for example, is relatively lacking and thus of significant worth to explore.
Evidence-based lean logic profiles for conceptual data modelling languages
Fillottrani, Pablo Rubén, Keet, C. Maria
Multiple logic-based reconstruction of conceptual data modelling languages such as EER, UML Class Diagrams, and ORM exists. They mainly cover various fragments of the languages and none are formalised such that the logic applies simultaneously for all three modelling language families as unifying mechanism. This hampers interchangeability, interoperability, and tooling support. In addition, due to the lack of a systematic design process of the logic used for the formalisation, hidden choices permeate the formalisations that have rendered them incompatible. We aim to address these problems, first, by structuring the logic design process in a methodological way. We generalise and extend the DSL design process to apply to logic language design more generally and, in particular, by incorporating an ontological analysis of language features in the process. Second, availing of this extended process, of evidence gathered of language feature usage, and of computational complexity insights from Description Logics (DL), we specify logic profiles taking into account the ontological commitments embedded in the languages. The profiles characterise the minimum logic structure needed to handle the semantics of conceptual models, enabling the development of interoperability tools. There is no known DL language that matches exactly the features of those profiles and the common core is small (in the tractable $\mathcal{ALNI}$). Although hardly any inconsistencies can be derived with the profiles, it is promising for scalable runtime use of conceptual data models.
On2Vec: Embedding-based Relation Prediction for Ontology Population
Chen, Muhao, Tian, Yingtao, Chen, Xuelu, Xue, Zijun, Zaniolo, Carlo
Populating ontology graphs represents a long-standing problem for the Semantic Web community. Recent advances in translation-based graph embedding methods for populating instance-level knowledge graphs lead to promising new approaching for the ontology population problem. However, unlike instance-level graphs, the majority of relation facts in ontology graphs come with comprehensive semantic relations, which often include the properties of transitivity and symmetry, as well as hierarchical relations. These comprehensive relations are often too complex for existing graph embedding methods, and direct application of such methods is not feasible. Hence, we propose On2Vec, a novel translation-based graph embedding method for ontology population. On2Vec integrates two model components that effectively characterize comprehensive relation facts in ontology graphs. The first is the Component-specific Model that encodes concepts and relations into low-dimensional embedding spaces without a loss of relational properties; the second is the Hierarchy Model that performs focused learning of hierarchical relation facts. Experiments on several well-known ontology graphs demonstrate the promising capabilities of On2Vec in predicting and verifying new relation facts. These promising results also make possible significant improvements in related methods.