Ontologies
ARGO - the Amazon Rainforest Genome Ontology project.
ARGO is short for the Amazon Rainforest Genome Ontology project. The mission is to tap into the scientific, biotechnological, and medical potential of the plant biodiversity in the Amazon rainforest in order to discover, utilise and preserve their biological value … before it is too late. If you want to follow our progress, please signup below.
Unsupervised Hierarchy Matching with Optimal Transport over Hyperbolic Spaces
Alvarez-Melis, David, Mroueh, Youssef, Jaakkola, Tommi S.
This paper focuses on the problem of unsupervised alignment of hierarchical data such as ontologies or lexical databases. This is a problem that appears across areas, from natural language processing to bioinformatics, and is typically solved by appeal to outside knowledge bases and label-textual similarity. In contrast, we approach the problem from a purely geometric perspective: given only a vector-space representation of the items in the two hierarchies, we seek to infer correspondences across them. Our work derives from and interweaves hyperbolic-space representations for hierarchical data, on one hand, and unsupervised word-alignment methods, on the other. We first provide a set of negative results showing how and why Euclidean methods fail in this hyperbolic setting. We then propose a novel approach based on optimal transport over hyperbolic spaces, and show that it outperforms standard embedding alignment techniques in various experiments on cross-lingual WordNet alignment and ontology matching tasks.
OntoScene, A Logic-based Scene Interpreter: Implementation and Application in the Rock Art Domain
Briola, Daniela, Mascardi, Viviana, Gioseffi, Massimiliano
OntoScene exploits ontologies for representing knowledge and Prolog for specifying the interpretation rules that domain experts may adopt, and for implementing the SceneInterpreter engine. Ontologies allow the designer to formalize the domain in a reusable way, and make the system modular and interoperable with existing multiagent systems, while Prolog provides a solid basis to define complex rules of interpretation in a way that can be affordable even for people with no background in Computational Logics. The domain selected for experimenting OntoScene is that of prehistoric rock art, which provides us with a fascinating and challenging testbed. Under consideration in Theory and Practice of Logic Programming (TPLP) KEYWORDS: Prolog; Ontologies; Multiagent Systems; Visual Languages; Scene Interpretation1 Introduction Human perception of complex visual scenes has been studied for a long time in psychology and neuroscience (Kondo et al. 2017): according to the seminal work on "high-level scene perception" (Henderson and Hollingworth 1999), besides low-level or early vision, concerned with extraction of physical properties such as depth, color, and texture from an image (Marr 1982), and intermediate-level vision, concerned with extraction of shape and spatial relations that can be determined without regard to meaning (Ullman 1996), a further level of vision is required to perceive and understand a scene: high-level vision concerns the mapping from visual representations to meaning and includes [...] the identification of objects and scenes. In their recent studies, Kveraga, Bar, and Baldassano (Kveraga and Bar 2014; Baldassano 2015) demonstrate that the brain has regions related to higher-order properties like overall geometry, arXiv:1911.04863v1
Overview of AI Libraries in Java Baeldung
Since this article is about libraries, we'll not make any introduction to AI itself. Additionally, theoretical background of AI is necessary in order to use libraries presented in this article. AI is a very wide field, so we will be focusing on the most popular fields today like Natural Language Processing, Machine Learning, Neural Networks and more. In the end, we'll mention few interesting AI challenges where you can practice your understanding of AI. Apache Jena is an open source Java framework for building semantic web and linked data applications from RDF data. The official website provides a detailed tutorial on how to use this framework with a quick introduction to RDF specification.
SHACL Constraints with Inference Rules
Pareti, Paolo, Konstantinidis, George, Norman, Timothy J., Şensoy, Murat
The Shapes Constraint Language (SHACL) has been recently introduced as a W3C recommendation to define constraints that can be validated against RDF graphs. Interactions of SHACL with other Semantic Web technologies, such as ontologies or reasoners, is a matter of ongoing research. In this paper we study the interaction of a subset of SHACL with inference rules expressed in datalog. On the one hand, SHACL constraints can be used to define a "schema" for graph datasets. On the other hand, inference rules can lead to the discovery of new facts that do not match the original schema. Given a set of SHACL constraints and a set of datalog rules, we present a method to detect which constraints could be violated by the application of the inference rules on some graph instance of the schema, and update the original schema, i.e, the set of SHACL constraints, in order to capture the new facts that can be inferred. We provide theoretical and experimental results of the various components of our approach.
A Heuristically Modified FP-Tree for Ontology Learning with Applications in Education
Shatnawi, Safwan, Gaber, Mohamed Medhat, Cocea, Mihaela
We propose a heuristically modified FP-Tree for ontology learning from text. Unlike previous research, for concept extraction, we use a regular expression parser approach widely adopted in compiler construction, i.e., deterministic finite automata (DFA). Thus, the concepts are extracted from unstructured documents. For ontology learning, we use a frequent pattern mining approach and employ a rule mining heuristic function to enhance its quality. This process does not rely on predefined lexico-syntactic patterns, thus, it is applicable for different subjects. We employ the ontology in a question-answering system for students' content-related questions. For validation, we used textbook questions/answers and questions from online course forums. Subject experts rated the quality of the system's answers on a subset of questions and their ratings were used to identify the most appropriate automatic semantic text similarity metric to use as a validation metric for all answers. The Latent Semantic Analysis was identified as the closest to the experts' ratings. We compared the use of our ontology with the use of Text2Onto for the question-answering system and found that with our ontology 80% of the questions were answered, while with Text2Onto only 28.4% were answered, thanks to the finer grained hierarchy our approach is able to produce.
Is it a Fruit, an Apple or a Granny Smith? Predicting the Basic Level in a Concept Hierarchy
Hollink, Laura, Bilgin, Aysenur, van Ossenbruggen, Jacco
The "basic level", according to experiments in cognitive psychology, is the level of abstraction in a hierarchy of concepts at which humans perform tasks quicker and with greater accuracy than at other levels. We argue that applications that use concept hierarchies - such as knowledge graphs, ontologies or taxonomies - could significantly improve their user interfaces if they `knew' which concepts are the basic level concepts. This paper examines to what extent the basic level can be learned from data. We test the utility of three types of concept features, that were inspired by the basic level theory: lexical features, structural features and frequency features. We evaluate our approach on WordNet, and create a training set of manually labelled examples that includes concepts from different domains. Our findings include that the basic level concepts can be accurately identified within one domain. Concepts that are difficult to label for humans are also harder to classify automatically. Our experiments provide insight into how classification performance across domains could be improved, which is necessary for identification of basic level concepts on a larger scale.
Knowledge Map: Toward a New Approach Supporting the Knowledge Management in Distributed Data Mining
Le-Khac, Nhien-An, Aouad, Lamine M., Kechadi, M-Tahar
Distributed data mining (DDM) deals with the problem of finding patterns or models, called knowledge, in an environment with distributed data and computations. Today, a massive amounts of data which are often geographically distributed and owned by different organisation are being mined. As consequence, a large mount of knowledge are being produced. This causes problems of not only knowledge management but also visualization in data mining. Besides, the main aim of DDM is to exploit fully the benefit of distributed data analysis while minimising the communication. Existing DDM techniques perform partial analysis of local data at individual sites and then generate a global model by aggregating these local results. These two steps are not independent since naive approaches to local analysis may produce an incorrect and ambiguous global data model. The integrating and cooperating of these two steps need an effective knowledge management, concretely an efficient map of knowledge in order to take the advantage of mined knowledge to guide mining the data. In this paper, we present "knowledge map", a representation of knowledge about mined knowledge. This new approach aims to manage efficiently mined knowledge in large scale distributed platform such as Grid. This knowledge map is used to facilitate not only the visualization, evaluation of mining results but also the coordinating of local mining process and existing knowledge to increase the accuracy of final model.