Goto

Collaborating Authors

 Overview


Transfer Learning - Machine Learning's Next Frontier

#artificialintelligence

In recent years, we have become increasingly good at training deep neural networks to learn a very accurate mapping from inputs to outputs, whether they are images, sentences, label predictions, etc. from large amounts of labeled data. What our models still frightfully lack is the ability to generalize to conditions that are different from the ones encountered during training. Every time you apply your model not to a carefully constructed dataset but to the real world. The real world is messy and contains an infinite number of novel scenarios, many of which your model has not encountered during training and for which it is in turn ill-prepared to make predictions. The ability to transfer knowledge to new conditions is generally known as transfer learning and is what we will discuss in the rest of this post. Over the course of this blog post, I will first contrast transfer learning with machine learning's most pervasive and successful paradigm, supervised learning. I will then outline reasons why transfer learning warrants our attention. Subsequently, I will give a more technical definition and detail different transfer learning scenarios. I will then provide examples of applications of transfer learning before delving into practical methods that can be used to transfer knowledge.


Deep Feature Synthesis: How Automated Feature Engineering Works

@machinelearnbot

The artificial intelligence market is fueled by the potential to use data to change the world. While many organizations have already successfully adapted to this paradigm, applying machine learning to new problems is still challenging. The single biggest technical hurdle that machine learning algorithms must overcome is their need for processed data in order to work -- they can only make predictions from numeric data. This data is composed of relevant variables, known as "features." If the calculated features don't clearly expose the predictive signals, no amount of tuning can take a model to the next level.


Graph Summarization Methods and Applications: A Survey

arXiv.org Artificial Intelligence

While advances in computing resources have made processing enormous amounts of data possible, human ability to identify patterns in such data has not scaled accordingly. Efficient computational methods for condensing and simplifying data are thus becoming vital for extracting actionable insights. In particular, while data summarization techniques have been studied extensively, only recently has summarizing interconnected data, or graphs, become popular. This survey is a structured, comprehensive overview of the state-of-the-art methods for summarizing graph data. We first broach the motivation behind, and the challenges of, graph summarization. We then categorize summarization approaches by the type of graphs taken as input and further organize each category by core methodology. Finally, we discuss applications of summarization on real-world graphs and conclude by describing some open problems in the field.


The variational Laplace approach to approximate Bayesian inference

arXiv.org Machine Learning

Variational approaches to approximate Bayesian inference provide very efficient means of performing parameter estimation and model selection. Among these, so-called variational-Laplace or VL schemes rely on Gaussian approximations to posterior densities on model parameters. In this note, we review the main variants of VL approaches, that follow from considering nonlinear models of continuous and/or categorical data. En passant, we also derive a few novel theoretical results that complete the portfolio of existing analyses of variational Bayesian approaches, including investigations of their asymptotic convergence. We also suggest practical ways of extending existing VL approaches to hierarchical generative models that include (e.g., precision) hyperparameters.


[R] Theoretical Impediments to Machine Learning With Seven Sparks from the Causal Revolution • r/MachineLearning

@machinelearnbot

Abstract: Current machine learning systems operate, almost exclusively, in a statistical, or model-free mode, which entails severe theoretical limits on their power and performance. Such systems cannot reason about interventions and retrospection and, therefore, cannot serve as the basis for strong AI. To achieve human level intelligence, learning machines need the guidance of a model of reality, similar to the ones used in causal inference tasks. To demonstrate the essential role of such models, I will present a summary of seven tasks which are beyond reach of current machine learning systems and which have been accomplished using the tools of causal modeling.


Behavior Trees in Robotics and AI: An Introduction

arXiv.org Artificial Intelligence

A Behavior Tree (BT) is a way to structure the switching between different tasks in an autonomous agent, such as a robot or a virtual entity in a computer game. BTs are a very efficient way of creating complex systems that are both modular and reactive. These properties are crucial in many applications, which has led to the spread of BT from computer game programming to many branches of AI and Robotics. In this book, we will first give an introduction to BTs, then we describe how BTs relate to, and in many cases generalize, earlier switching structures. These ideas are then used as a foundation for a set of efficient and easy to use design principles. Properties such as safety, robustness, and efficiency are important for an autonomous system, and we describe a set of tools for formally analyzing these using a state space description of BTs. With the new analysis tools, we can formalize the descriptions of how BTs generalize earlier approaches. We also show the use of BTs in automated planning and machine learning. Finally, we describe an extended set of tools to capture the behavior of Stochastic BTs, where the outcomes of actions are described by probabilities. These tools enable the computation of both success probabilities and time to completion.


Oncotarget Converging blockchain and next-generation artificial intelligence technologies to decentralize and accelerate biomedical research and healthcare

#artificialintelligence

The increased availability of data and recent advancements in artificial intelligence present the unprecedented opportunities in healthcare and major challenges for the patients, developers, providers and regulators. The novel deep learning and transfer learning techniques are turning any data about the person into medical data transforming simple facial pictures and videos into powerful sources of data for predictive analytics. Presently, the patients do not have control over the access privileges to their medical records and remain unaware of the true value of the data they have. In this paper, we provide an overview of the next-generation artificial intelligence and blockchain technologies and present innovative solutions that may be used to accelerate the biomedical research and enable patients with new tools to control and profit from their personal data as well with the incentives to undergo constant health monitoring. We introduce new concepts to appraise and evaluate personal records, including the combination-, time- and relationship-value of the data.


Top Stories, Jan 1-7: Docker for Data Science; Quantum Machine Learning: An Overview

@machinelearnbot

Docker for Data Science, by Sachin Abeywardana Top 10 Machine Learning Algorithms for Beginners, by Reena Shaw How Much Mathematics Does an IT Engineer Need to Learn to Get Into Data Science? How Much Mathematics Does an IT Engineer Need to Learn to Get Into Data Science? Top Stories, Dec 18-31: How Much Mathematics Does an IT Engineer Need to Learn to Get Into Data Science?; Computer Vision by Andrew Ng – 11 Lessons Learned - Jan 03, 2018. How to build a Successful Advanced Analytics Department - Jan 04, 2018. Top Stories, Dec 18-31: How Much Mathematics Does an IT Engineer Need to Learn to Get Into Data Science?; Computer Vision by Andrew Ng – 11 Lessons Learned - Jan 03, 2018.


A Review of 40 Years of Cognitive Architecture Research: Core Cognitive Abilities and Practical Applications

arXiv.org Artificial Intelligence

In this paper we present a broad overview of the last 40 years of research on cognitive architectures. Although the number of existing architectures is nearing several hundred, most of the existing surveys do not reflect this growth and focus on a handful of well-established architectures. Thus, in this survey we wanted to shift the focus towards a more inclusive and high-level overview of the research on cognitive architectures. Our final set of 84 architectures includes 49 that are still actively developed, and borrow from a diverse set of disciplines, spanning areas from psychoanalysis to neuroscience. To keep the length of this paper within reasonable limits we discuss only the core cognitive abilities, such as perception, attention mechanisms, action selection, memory, learning and reasoning. In order to assess the breadth of practical applications of cognitive architectures we gathered information on over 900 practical projects implemented using the cognitive architectures in our list. We use various visualization techniques to highlight overall trends in the development of the field. In addition to summarizing the current state-of-the-art in the cognitive architecture research, this survey describes a variety of methods and ideas that have been tried and their relative success in modeling human cognitive abilities, as well as which aspects of cognitive behavior need more research with respect to their mechanistic counterparts and thus can further inform how cognitive science might progress.


Data Scientists and Machine Learning Algorithms for the Data-Driven World - DATAVERSITY

@machinelearnbot

Artificial Intelligence (AI) and Machine Learning are projected to become mainstream technologies in the coming years, and are clearly already having a significant impact across many industries. How exactly is this happening? How are Data Scientists using their skills to develop better Machine Learning algorithms? Where are these innovative technologies going in the future? With the rise in the implementation and usage of once revolutionary technologies/trends like Big Data, the Internet of Things (IoT), or the Cloud, Machine Learning (ML) and now Deep Learning (DL) are gradually moving into mainstream business corridors.