Ontologies
Parsa Mirhaji Montefiore Health System - PMWC Precision Medicine World Conference
Dr. Mirhaji was the former director of the Center for Biosecurity and Public Health Informatics Research at the University of Texas at Houston where he developed clinical text understanding, semantic information integration, and EMR interoperability solutions, for public health and disaster preparedness. He is an inventor with several patents covering information integration, biomedical vocabularies and taxonomy services, clinical text understanding and natural language processing, electronic data capture, and knowledge-based information retrieval. Dr. Mirhaji and his fellow researchers were awarded "The Best Practice in Public Health. He is a member of W3C working groups for application of Semantic Technologies in Healthcare and Life Sciences, and organizer and committee member for several national and international conferences on Bio-Ontologies and Semantic Technologies.
Webinar summary - Semantic annotation of images in the FAIR data era CGIAR Platform for Big Data in Agriculture
Digital agriculture increasingly relies on the generation of large quantity of images. These images are processed with machine learning techniques to speed up the identification of objects, their classification, visualization, and interpretation. However, images must comply with the FAIR principles to facilitate their access, reuse, and interoperability. As stated in recent paper authored by the Planteome team (Trigkakis et al, 2018), "Plant researchers could benefit greatly from a trained classification model that predicts image annotations with a high degree of accuracy." In this third Ontologies Community of Practice webinar, Justin Preece, Senior Faculty Research Assistant Oregon State University, presents the module developed by the Planteome project using the Bio-Image Semantic Query User Environment (BISQUE), an online image analysis and storage platform of Cyverse.
The future of Pharma: harnessing AI to decentralise data
As Chief Data Officer for the OSTHUS Group, Eric Little co-founded LeapAnalysis, a new approach to AI, data integration and analytics. LeapAnalysis is the first fully federated and virtualised search and analytics engine that runs on semantic metadata. It allows users to combine semantic models (ontologies) with machine learning algorithms to provide customers with unparalleled flexibility in utilizing their data. Nearly all technologies surrounding AI and analytics are purely statistical in nature, using algorithmic approaches that are not incredibly novel, such as decision trees, neural networks, etc. The logical framework that contextualises these things is often missing.
Enabling Semantic Data Access for Toxicological Risk Assessment
Myklebust, Erik Bryhn, Jimenez-Ruiz, Ernesto, Chen, Jiaoyan, Wolf, Raoul, Tollefsen, Knut Erik
Experimental effort and animal welfare are concerns when exploring the effects a compound has on an organism. Appropriate methods for extrapolating chemical effects can further mitigate these challenges. In this paper we present the efforts to (i) (pre)process and gather data from public and private sources, varying from tabular files to SPARQL endpoints, (ii) integrate the data and represent them as a knowledge graph with richer semantics. This knowledge graph is further applied to facilitate the retrieval of the relevant data for a ecological risk assessment task, extrapolation of effect data, where two prediction techniques are developed.
General Fragment Model for Information Artifacts
Fiorini, Sandro Rama, Santos, Wallas Sousa dos, Mesquita, Rodrigo Costa, Lima, Guilherme Ferreira, Moreno, Marcio F.
The use of semantic descriptions in data intensive domains require a systematic model for linking semantic descriptions with their manifestations in fragments of heterogeneous information and data objects. Such information heterogeneity requires a fragment model that is general enough to support the specification of anchors from conceptual models to multiple types of information artifacts. While diverse proposals of anchoring models exist in the literature, they are usually focused in audiovisual information. We propose a generalized fragment model that can be instantiated to different kinds of information artifacts. Our objective is to systematize the way in which fragments and anchors can be described in conceptual models, without committing to a specific vocabulary.
Commonsense Reasoning Using WordNet and SUMO: a Detailed Analysis
Álvez, Javier, Gonzalez-Dios, Itziar, Rigau, German
We describe a detailed analysis of a sample of large benchmark of commonsense reasoning problems that has been automatically obtained from WordNet, SUMO and their mapping. The objective is to provide a better assessment of the quality of both the benchmark and the involved knowledge resources for advanced commonsense reasoning tasks. By means of this analysis, we are able to detect some knowledge misalignments, mapping errors and lack of knowledge and resources. Our final objective is the extraction of some guidelines towards a better exploitation of this commonsense knowledge framework by the improvement of the included resources.
SQuAP-Ont: an Ontology of Software Quality Relational Factors from Financial Systems
Ciancarini, Paolo, Nuzzolese, Andrea Giovanni, Presutti, Valentina, Russo, Daniel
Quality, architecture, and process are considered the keystones of software engineering. ISO defines them in three separate standards. However, their interaction has been scarcely studied, so far. The SQuAP model (Software Quality, Architecture, Process) describes twenty-eight main factors that impact on software quality in banking systems, and each factor is described as a relation among some characteristics from the three ISO standards. Hence, SQuAP makes such relations emerge rigorously, although informally. In this paper, we present SQuAP-Ont, an OWL ontology designed by following a well-established methodology based on the reuse of Ontology Design Patterns (i.e. SQuAP-Ont formalises the relations emerging from SQuAP to represent and reason via Linked Data about software engineering in a three-dimensional model consisting of quality, architecture, and process ISO characteristics. Industrial standards are widely used in the software engineering practice: they are built on preexisting literature and provide a common ground to scholars and practitioners to analyze, develop, and assess software systems. As far as software quality is concerned, the reference standard is the ISO/IEC 25010:2011 (ISO quality from now on), which defines the quality of software products and their usage (i.e., in-use quality). The ISO quality standard introduces eight characteristics that qualify a software product, and five characteristics that assess its quality in use. A characteristic is a parameter for measuring the quality of a software system-related aspect, e.g., reliability, usability, performance efficiency. The quantitative value associated with a characteristic is measured employing metrics that are dependent on the context of a specific software project and defined following the established literature.
From Textual Information Sources to Linked Data in the Agatha Project
Quaresma, Paulo, Nogueira, Vitor Beires, Raiyani, Kashyap, Bayot, Roy, Gonçalves, Teresa
Automatic reasoning about textual information is a challenging task in modern Natural Language Processing (NLP) systems. In this work we describe our proposal for representing and reasoning about Portuguese documents by means of Linked Data like ontologies and thesauri. Our approach resorts to a specialized pipeline of natural language processing (part-of-speech tagger, named entity recognition, semantic role labeling) to populate an ontology for the domain of criminal investigations. The provided architecture and ontology are language independent. Although some of the NLP modules are language dependent, they can be built using adequate AI methodologies.
The Semantic Asset Administration Shell
Bader, Sebastian R., Maleshkova, Maria
The disruptive potential of the upcoming digital transformations for the industrial manufacturing domain have led to several reference frameworks and numerous standardization approaches. On the other hand, the Semantic Web community has made significant contributions in the field, for instance on data and service description, integration of heterogeneous sources and devices, and AI techniques in distributed systems. These two streams of work are, however, mostly unrelated and only briefly regard each others requirements, practices and terminology. We contribute to closing this gap by providing the Semantic Asset Administration Shell, an RDF-based representation of the Industrie 4.0 Component. We provide an ontology for the latest data model specification, created a RML mapping, supply resources to validate the RDF entities and introduce basic reasoning on the Asset Administration Shell data model. Furthermore, we discuss the different assumptions and presentation patterns, and analyze the implications of a semantic representation on the original data. We evaluate the thereby created overheads, and conclude that the semantic lifting is manageable, also for restricted or embedded devices, and therefore meets the needs of Industrie 4.0 scenarios.
Smart Buildings with IoT Knowledge Graphs at Schneider Electric
In April 2019 our partner Schneider Electric launched EcoStruxure Workplace Advisor, a smart building application aiming to increase the efficiency of managed office facilities. In this posting I want to outline the general architecture of this application which is based on Trinity RDF: our enterprise .NET API which enables developers without RDF experience to build knowledge graph applications. For anyone interested in increasing the productivity and flexibility of knowledge graph development teams I would like to advertise my talk on Tuesday where I will share more details about the case. The industry use case I will be presenting is Schneider Electric's EcoStruxure Workplace Advisor. Using this service one can derive actionable insights about a building through intuitive dashboards that analyse and integrate data from numerable IoT sensors and systems.