Ontologies
Who will speak at Data Day Texas 2020
Take advantage of our discount rooms at the conference hotel. We are beginning to announce speakers for 2020. Want to join us as a speaker? Check out our proposals page. Jesse Anderson is a data engineer, creative engineer, and managing director of the Big Data Institute. He works with companies ranging from startups to Fortune 100 companies on Big Data. This includes training on cutting edge technologies like Apache Kafka, Apache Hadoop and Apache Spark. He has taught over 30,000 people the skills to become data engineers.
The Data Fabric for Machine Learning – Part 2: Building a Knowledge-Graph - KDnuggets
How the new advances in semantics can help us be better at Machine Learning. Deep learning on graphs is taking more importance by the day. I've been talking about the data fabric in general, and giving some concepts of Machine Learning and Deep Learning in the data fabric. The Data Fabric is the platform that supports all the data in the company. How it's managed, described, combined and universally accessed.
Knowledge-based Biomedical Data Science 2019
Callahan, Tiffany J., Pielke-Lombardo, Harrison, Tripodi, Ignacio J., Hunter, Lawrence E.
Knowledge-based biomedical data science (KBDS) involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey the progress in the last year in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing, and the expansion of knowledge-based approaches to novel domains, such as Chinese Traditional Medicine and biodiversity.
An achievable view of artificial intelligence
Artificial intelligence (AI) has been just over the horizon for decades now. From the cautionary tale of AI run amok in Stanley Kubrick's 2001: A Space Odyssey, to the benign computerized assistant that helped Captain Kirk "boldly go where no man had gone before" in Star Trek, the 1960s were filled with visions of an AI-enhanced future that still hasn't materialized a half-century later. But today we are assured that, despite the slow progress of the early years, we are truly on the cusp of realizing the vision of practical AI. It seems that every product that includes software advertises itself as leveraging the power of AI. With so much hype, a sober consideration of reality is in order.
Method for the semantic indexing of concept hierarchies, uniform representation, use of relational database systems and generic and case-based reasoning
Petersohn, Uwe, Zimmer, Sandra, Lehmann, Jens
This paper presents a method for semantic indexing and describes its application in the field of knowledge representation. Starting point of the semantic indexing is the knowledge represented by concept hierarchies. The goal is to assign keys to nodes (concepts) that are hierarchically ordered and syntactically and semantically correct. With the indexing algorithm, keys are computed such that concepts are partially unifiable with all more specific concepts and only semantically correct concepts are allowed to be added. The keys represent terminological relationships. Correctness and completeness of the underlying indexing algorithm are proven. The use of classical relational databases for the storage of instances is described. Because of the uniform representation, inference can be done using case-based reasoning and generic problem solving methods.
Distance-Based Approaches to Repair Semantics in Ontology-based Data Access
Prouté, César, Yun, Bruno, Croitoru, Madalina
In the presence of inconsistencies, repair techniques thrive to restore consistency by reasoning with several repairs. However, since the number of repairs can be large, standard inconsistent tolerant semantics usually yield few answers. In this paper, we use the notion of syntactic distance between repairs following the intuition that it can allow us to cluster some repairs "close" to each other. In this way, we propose a generic framework to answer queries in a more personalise fashion.
Deep Multiple Instance Learning for Taxonomic Classification of Metagenomic read sets
Georgiou, Andreas, Fortuin, Vincent, Mustafa, Harun, Rätsch, Gunnar
Metagenomic studies have increasingly utilized sequencing technologies in order to analyze DNA fragments found in environmental samples. It can provide useful insights for studying the interactions between hosts and microbes, infectious disease proliferation, and novel species discovery. One important step in this analysis is the taxonomic classification of those DNA fragments. Of particular interest is the determination of the distribution of the taxa of microbes in metagenomic samples. Recent attempts using deep learning focus on architectures that classify single DNA reads independently from each other. In this work, we attempt to solve the task of directly predicting the distribution over the taxa of whole metagenomic read sets. We formulate this task as a Multiple Instance Learning (MIL) problem. We extend architectures used in single-read taxonomic classification with two different types of permutation-invariant MIL pooling layers: a) deepsets and b) attention-based pooling. We illustrate that our architecture can exploit the co-occurrence of species in metagenomic read sets and outperforms the single-read architectures in predicting the distribution over the taxa at higher taxonomic ranks.
Artificial Intelligence BlockCloud (AIBC) Technical Whitepaper
The AIBC is an Artificial Intelligence and blockchain technology based large-scale decentralized ecosystem that allows system-wide low-cost sharing of computing and storage resources. The AIBC consists of four layers: a fundamental layer, a resource layer, an application layer, and an ecosystem layer. The AIBC implements a two-consensus scheme to enforce upper-layer economic policies and achieve fundamental layer performance and robustness: the DPoEV incentive consensus on the application and resource layers, and the DABFT distributed consensus on the fundamental layer. The DABFT uses deep learning techniques to predict and select the most suitable BFT algorithm in order to achieve the best balance of performance, robustness, and security. The DPoEV uses the knowledge map algorithm to accurately assess the economic value of digital assets.
Data Interpretation Support in Rescue Operations: Application for French Firefighters
Chehade, Samer, Matta, Nada, Pothin, Jean-Baptiste, Cogranne, Rémi
--This work aims at developing a system that supports French firefighters in data interpretation during rescue operations. An application ontology is proposed based on existing crisis management ones and operational expertise collection. After that, a knowledge-based system will be developed and integrated in firefighters' environment. Our first studies are shown in this paper. Rescue of people consists in saving their life in case of distress situations by applying responsive operations. In France, it is defined as specific tasks to be accomplished by public services in order to ensure the safety of patients and victims by making them able to escape from dangers, securing intervention sites, providing medical help, and finally, ensuring the evacuation to an appropriate place of reception [1].