Goto

Collaborating Authors

 Overview


ASlib: A Benchmark Library for Algorithm Selection

arXiv.org Artificial Intelligence

The task of algorithm selection involves choosing an algorithm from a set of algorithms on a per-instance basis in order to exploit the varying performance of algorithms over a set of instances. The algorithm selection problem is attracting increasing attention from researchers and practitioners in AI. Years of fruitful applications in a number of domains have resulted in a large amount of data, but the community lacks a standard format or repository for this data. This situation makes it difficult to share and compare different approaches effectively, as is done in other, more established fields. It also unnecessarily hinders new researchers who want to work in this area. To address this problem, we introduce a standardized format for representing algorithm selection scenarios and a repository that contains a growing number of data sets from the literature. Our format has been designed to be able to express a wide variety of different scenarios. Demonstrating the breadth and power of our platform, we describe a set of example experiments that build and evaluate algorithm selection models through a common interface. The results display the potential of algorithm selection to achieve significant performance improvements across a broad range of problems and algorithms.


Machine Learning in Bioinformatics and Biomedical Engineering

#artificialintelligence

Machine learning is an artificial intelligence branch that has been well applied and recognized as an effective tool to handle a wide range of real situations. In the last few years, we have witnessed the explosion of Big Data, which has enables researchers to store data for analysis in an unprecedented way. This explosion in data available for analysis is as evident in healthcare as anywhere else. In particular, this special issue is focused on the areas of bioinformatics and biomedical engineering. These are two of the fastest developing research fields in the last few decades, since the biological data used to provide information is rapidly generated, and it is mandatory to be able to extract information and knowledge from them, as technological innovation in these fields is to be probably one of the most important developments in the next coming years.


[Video] Meet the Vietnamese Engineer Developing Google's Artificial Intelligence Saigoneer

#artificialintelligence

Next time you ask Google for directions or run an image search, thank Le Viet Quoc. The 34-year-old Vietnamese engineer is part of the team behind Google Brain, an artificial intelligence (AI) research project whose technology is responsible for such features, reports VnExpress. Part of Google's not-so-secret research outfit X, which pioneers cutting-edge technology like self-driving cars and delivery drones, Quoc works in a field known as "deep learning" which uses the human brain as a model to create "neural networks" for computers. Though deep learning's development has been slow, engineers like Quoc are making progress: in 2012, Google Brain made headlines when its network of 16,000 computer processors successfully learned how to search for cat videos on YouTube, despite being given no information prior to the test on how to identify such animals. The Stanford grad, who holds a doctorate in computer science and was named one of MIT's Innovators Under 35, is still working toward the creation of better, more intelligent machines.


Quadratization and Roof Duality of Markov Logic Networks

Journal of Artificial Intelligence Research

This article discusses the quadratization of Markov Logic Networks, which enables efficient approximate MAP computation by means of maximum flows. The procedure relies on a pseudo-Boolean representation of the model, and allows handling models of any order. The employed pseudo-Boolean representation can be used to identify problems that are guaranteed to be solvable in low polynomial-time. Results on common benchmark problems show that the proposed approach finds optimal assignments for most variables in excellent computational time and approximate solutions that match the quality of ILP-based solvers.


Generalized system identification with stable spline kernels

arXiv.org Machine Learning

Regularized least-squares approaches have been successfully applied to linear system identification. Recent approaches use quadratic penalty terms on the unknown impulse response defined by stable spline kernels, which control model space complexity by leveraging regularity and bounded-input bounded-output stability. This paper extends linear system identification to a wide class of nonsmooth stable spline estimators, where regularization functionals and data misfits can be selected from a rich set of piecewise linear quadratic penalties. This class encompasses the 1-norm, huber, and vapnik, in addition to the least-squares penalty, and the approach allows linear inequality constraints on the unknown impulse response. We develop a customized interior point solver for the entire class of proposed formulations. By representing penalties through their conjugates, we allow a simple interface that enables the user to specify any piecewise linear quadratic penalty for misfit and regularizer, together with inequality constraints on the response. The solver is locally quadratically convergent, with O(n2(m+n)) arithmetic operations per iteration, for n impulse response coefficients and m output measurements. In the system identification context, where n << m, IPsolve is competitive with available alternatives, illustrated by a comparison with TFOCS and libSVM. The modeling framework is illustrated with a range of numerical experiments, featuring robust formulations for contaminated data, relaxation systems, and nonnegativity and unimodality constraints on the impulse response. Incorporating constraints yields significant improvements in system identification. The solver used to obtain the results is distributed via an open source code repository.


EmTech India 2016: The digital future

#artificialintelligence

Global technology leaders and senior executives from around the world spoke on a range of topics, including Digital India, Smart Cities, Make in India, Skill India and cutting-edge technologies such as artificial intelligence, machine learning, 3D printing, drones, robotics, robotic surgeries and genomics, at the two-day EmTech India 2016 event, held in New Delhi on 18 and 19 March. The event was organized by Mint and MIT Technology Review, published by the Massachusetts Institute of Technology (MIT). The speakers included Jack Hidary, senior adviser at Google X Labs; Bhaskar Pramanik, chairman of Microsoft India; and Sharad Sharma, co-founder of think tank iSPIRT. The full list can be accessed at emtech.livemint.com/speakers. Here are edited excerpts from their speeches. A moonshot is an initiative that accompanies a goal that was previously thought to be near impossible. Moonshot philosophy sounds like it is quite radical and risky, but actually it is low-risk. That is because it attracts the best human capital and finance. Moonshot approaches do a few things. First, they attract the best human capital, which is a key driver of growth. They attract the best financial capital as well; capital from big and long-term thinkers. One describes India as a moonshot nation. India itself is going through a radical transformation--the likes of what we have never seen. This is very different to what is happening in China or any other country in the world. It is a combination of smartphones, digital payments, broadband and power of energy storage coming together. Smartphones ease the access to the Internet and open up users to mobile apps and that really changes the name of the game.


Deep Learning in a Nutshell: Core Concepts

#artificialintelligence

This post is the first in a series I'll be writing for Parallel Forall that aims to provide an intuitive and gentle introduction to deep learning. It covers the most important deep learning concepts and aims to provide an understanding of each concept rather than its mathematical and theoretical details. While the mathematical terminology is sometimes necessary and can further understanding, these posts use analogies and images whenever possible to provide easily digestible bits comprising an intuitive overview of the field of deep learning. I wrote this series in a glossary style so it can also be used as a reference for deep learning concepts. Part 1 focuses on introducing the main concepts of deep learning. Part 2 provides historical background and delves into the training procedures, algorithms and practical tricks that are used in training for deep learning. Part 3 covers sequence learning, including recurrent neural networks, LSTMs, and encoder-decoder systems for neural machine translation.


Valuing the Artificial Intelligence Market, Graphs and Predictions for 2016 and Beyond TechEmergence.com

#artificialintelligence

Wall Street, venture capitalists, technology executives – all have important reasons to understand the growth and opportunity of artificial intelligence, but the inherent vagueness of the term makes any single valuation extremely difficult. Indeed, the term "artificial intelligence" is notorious for having a relatively amorphous definition, itself. In order to put together an executive brief for market size and projected growth of AI, I've molded this article around (a) AI-related industry market research forecasts, and (b) a limited number of reputable research sources for further insight into AI valuation and forecasting, in addition to select and relevant quotes. Bear in mind that different market research firms define "artificial intelligence." To make this summary article more useful, we've quickly broken down all reports by source, definition / meaning of "artificial intelligence", valuation, and timeline.


Implementing your own k-nearest neighbour algorithm using Python

#artificialintelligence

In machine learning, you may often wish to build predictors that allows to classify things into categories based on some set of associated values. For example, it is possible to provide a diagnosis to a patient based on data from previous patients. Many algorithms have been developed for automated classification, and common ones include random forests, support vector machines, Naïve Bayes classifiers, and many types of neural networks. To get a feel for how classification works, we take a simple example of a classification algorithm – k-Nearest Neighbours (kNN) – and build it from scratch in Python 2. You can use a mostly imperative style of coding, rather than a declarative/functional one with lambda functions and list comprehensions to keep things simple if you are starting with Python. Here, we will provide an introduction to the latter approach.


Bay Area NLP (Natural Language Processing)

@machinelearnbot

Stanford CoreNLP is an extensible, open source, JVM-based NLP toolkit with good quality core natural language analysis components, quite widely used in academia, companies, and government. This talk will give an overview, look at use from the command-line, code, and the web API, including the new server and new annotators, and provide a deeper dive going through a pipeline we recently built for a machine reading task. We'd also welcome questions (and requests!) from people who have used CoreNLP. We'll open at 7 and the talk will begin at 7:30, co-presented by Christopher Manning, Professor at Stanford University and Jason Bolton, Research Engineer at Stanford University.