Ontologies
RHOG: A Refinement-Operator Library for Directed Labeled Graphs
Intuitively, locally finiteness means that the refinement operator is computable, completeness means we can generate, by refinement of a, any element of G related to a given element g 1 by the order relation, and properness means that a refinement operator does not generate elements which are equivalent to the element being refined. When a refinement operator is locally finite, complete and proper, we say that it is ideal. Notice that all the subsumption relations presented above satisfy the reflexive 2 and transitive 3 properties. Therefore, the pair (G,), where G is the set of all DLGs given a set of labels L, and is any of the subsumption relations defined above is a quasi-ordered set. Thus, this opens the door to defining refinement operators for DLGs. Intuitively, a downward refinement operator for DLGs will generate refinements of a given DLG by either adding vertices, edges, or by making some of the labels more specific, thus making the graph more specific. In the following subsections, we will introduce a collection of refinement operators for connected DLGs, and discuss their theoretical properties. A summary of these operators is shown in Table 1, where we show that under the object-identity constraint, all the refinement operators presented in this document are ideal. If we do not impose object-identity, then the operators are locally complete and complete, but not proper.
Ontologies in CLARIAH: Towards Interoperability in History, Language and Media
Meroño-Peñuela, Albert, de Boer, Victor, van Erp, Marieke, Melder, Willem, Mourits, Rick, Rijpma, Auke, Schalk, Ruben, Zijdeman, Richard
One of the most important goals of digital humanities is to provide researchers with data and tools for new research questions, either by increasing the scale of scholarly studies, linking existing databases, or improving the accessibility of data. Here, the FAIR principles provide a useful framework as these state that data needs to be: Findable, as they are often scattered among various sources; Accessible, since some might be offline or behind paywalls; Interoperable, thus using standard knowledge representation formats and shared vocabularies; and Reusable, through adequate licensing and permissions. Integrating data from diverse humanities domains is not trivial, research questions such as "was economic wealth equally distributed in the 18th century?", or "what are narratives constructed around disruptive media events?") and preparation phases (e.g. data collection, knowledge organisation, cleaning) of scholars need to be taken into account. In this chapter, we describe the ontologies and tools developed and integrated in the Dutch national project CLARIAH to address these issues across datasets from three fundamental domains or "pillars" of the humanities (linguistics, social and economic history, and media studies) that have paradigmatic data representations (textual corpora, structured data, and multimedia). We summarise the lessons learnt from using such ontologies and tools in these domains from a generalisation and reusability perspective.
Graph Databases: The Story-tellers of the Database World
The big rub on the first generation of graph databases was that although RDF triple stores were great at storing the simple sentence, they had a hard time with the adverbs, adjectives and clarifying phrases of your data story. If I wanted to store'John is a carpenter since 2001' or'John from Alberta Canada is a carpenter liked by 702 people', the syntax of old-school triple stores had a more tedious, but not impossible way of handling it. It involved creating extra nodes that were confusing to some and a process called reification. Until about a year ago, labeled property graphs (LPG) were better at color and detail than RDF, having a more intuitive syntax for clarifying adverbs, adjectives, and phrases. That was, of course, until recently.
BIM4EEB ontologies framework - BIM4EEB
Interoperability in the construction sector is a key issue and researchers, developers and designers have tackled since the introduction of CAD systems. Traditionally, engineers, architects and site operators interact and track their information exchange through paper or digitalized drawings and e-mails. With the introduction of Building Information Modelling (BIM) techniques and tools, operators are using new solutions and methods to keep track and exploit these data. Cover image: ifcOWL ontology (version IFC4ADD2) visualized thanks to WebVOWL, available hereWhat has been described as traditional method corresponds to Level 0 in well-known BIM levels definition. The concept of BIM level 1 represents the criteria needed for the full collaboration for the industry.
Ontology-based Interpretable Machine Learning for Textual Data
Lai, Phung, Phan, NhatHai, Hu, Han, Badeti, Anuja, Newman, David, Dou, Dejing
In this paper, we introduce a novel interpreting framework that learns an interpretable model based on an ontology-based sampling technique to explain agnostic prediction models. Different from existing approaches, our algorithm considers contextual correlation among words, described in domain knowledge ontologies, to generate semantic explanations. To narrow down the search space for explanations, which is a major problem of long and complicated text data, we design a learnable anchor algorithm, to better extract explanations locally. A set of regulations is further introduced, regarding combining learned interpretable representations with anchors to generate comprehensible semantic explanations. An extensive experiment conducted on two real-world datasets shows that our approach generates more precise and insightful explanations compared with baseline approaches.
Enriching Consumer Health Vocabulary Using Enhanced GloVe Word Embedding
Ibrahim, Mohammed, Gauch, Susan, Salman, Omar, Alqahatani, Mohammed
Open-Access and Collaborative Consumer Health Vocabulary (OAC CHV, or CHV for short), is a collection of medical terms written in plain English. It provides a list of simple, easy, and clear terms that laymen prefer to use rather than an equivalent professional medical term. The National Library of Medicine (NLM) has integrated and mapped the CHV terms to their Unified Medical Language System (UMLS). These CHV terms mapped to 56000 professional concepts on the UMLS. We found that about 48% of these laymen's terms are still jargon and matched with the professional terms on the UMLS. In this paper, we present an enhanced word embedding technique that generates new CHV terms from a consumer-generated text. We downloaded our corpus from a healthcare social media and evaluated our new method based on iterative feedback to word embeddings using ground truth built from the existing CHV terms. Our feedback algorithm outperformed unmodified GLoVe and new CHV terms have been detected.
Best Practices for Implementing FAIR Vocabularies and Ontologies on the Web
Garijo, Daniel, Poveda-Villalón, María
With the adoption of Semantic Web technologies, an increasing number of vocabularies and ontologies have been developed in different domains, ranging from Biology to Agronomy or Geosciences. However, many of these ontologies are still difficult to find, access and understand by researchers due to a lack of documentation, URI resolving issues, versioning problems, etc. In this chapter we describe guidelines and best practices for creating accessible, understandable and reusable ontologies on the Web, using standard practices and pointing to existing tools and frameworks developed by the Semantic Web community. We illustrate our guidelines with concrete examples, in order to help researchers implement these practices in their future vocabularies.
Can AI Achieve Common Sense to Make Machines More Intelligent?
Today machines with artificial intelligence (AI) are becoming more prevalent in society. Across many fields, AI has taken over numerous tasks that humans used to do earlier. As the reference is to human intelligence, artificial intelligence is being modified into what humans can do. However, the technology has not yet matched the level of utmost wisdom possessed by humans and it seems like it is not going to achieve the milestone any time sooner. To replace human beings at most jobs, machines need to exhibit what we intuitively call "common sense".
Axiom Pinpointing
Axiom pinpointing refers to the task of finding the specific axioms in an ontology which are responsible for a consequence to follow. This task has been studied, under different names, in many research areas, leading to a reformulation and reinvention of techniques. In this work, we present a general overview to axiom pinpointing, providing the basic notions, different approaches for solving it, and some variations and applications which have been considered in the literature. This should serve as a starting point for researchers interested in related problems, with an ample bibliography for delving deeper into the details.