Overview
Deep Advances in Generative Modeling
In recent years, deep learning approaches have come to dominate discriminative problems in many sub-areas of machine learning. Alongside this, they have also powered exciting improvements in generative and conditional modeling of richly structured data such as text, images, and audio. This talk, led by indico's Head of Research, Alec Radford, will serve as an introduction to several emerging application areas of generative modeling and provide a survey of recent techniques in the field.
Machine Learning Prague 2016 – conference on machine learning in practice
With ever increasing data, Machine Learning is becoming the only way to get analytics done, making it possible to glean insights from vast amounts of data. But when starting a project, it is easy to ignore a critical fact: the value of data is also time sensitive – it expires! In order to get the highest value from data, Machine Learning needs to be applied in a rapid and repeatable way, so you can go from data to insight quickly. A Machine Learning API makes this possible. In this workshop, Poul Petersen CIO of BigML will give an overview of BigML's Machine Learning API and then show real-world examples of predictive applications that can be built using Python and node.js. Several tools that have been built on top of BigML's API will be demonstrated including a loan risk assessment, real estate arbitrage, and the world's first voice controlled predictive assistant.
SpeechTEK agenda for Monday, May 23, 2016
The field of intellectual property is rapidly evolving, both with respect to the law and the technologies being considered for protection. This session provides a primer about what a patent is, current best practices for protecting speech technologies and defending against assertion, and the recent evolution of intellectual property law in the United States, with emphasis on speech, software user interfaces, and mobile technologies. Fraudsters are using robodialing and ANI spoofing to wreak havoc on call centers. From the illegal practice of toll-free traffic pumping and international revenue-sharing fraud, to the more villainous acts of financial account fraud, identity theft, and drug trafficking, this seminar explores the unusual ways criminals are hacking our businesses. We also examine simple and cost-effective practices to protect our businesses, and our customers.
Patterns of Scalable Bayesian Inference
Angelino, Elaine, Johnson, Matthew James, Adams, Ryan P.
Datasets are growing not just in size but in complexity, creating a demand for rich models and quantification of uncertainty. Bayesian methods are an excellent fit for this demand, but scaling Bayesian inference is a challenge. In response to this challenge, there has been considerable recent work based on varying assumptions about model structure, underlying computational resources, and the importance of asymptotic correctness. As a result, there is a zoo of ideas with few clear overarching principles. In this paper, we seek to identify unifying principles, patterns, and intuitions for scaling Bayesian inference. We review existing work on utilizing modern computing resources with both MCMC and variational approximation techniques. From this taxonomy of ideas, we characterize the general principles that have proven successful for designing scalable inference procedures and comment on the path forward.
Global Brain That Makes You Think Twice
Rzepka, Rafal (Hokkaido University) | Mazur, Michal (Hokkaido University) | Clapp, Austin (Stanford University) | Araki, Kenji (Hokkaido University)
In this position paper we introduce our approach to positive computing by developing and integrating methods for future assistant and companion agents which could help us a) avoid making mistakes due to biases caused by insufficient knowledge, b) be more empathic and righteous, c) be more sensitive and thoughtful. We present text processing techniques for automatic discovery of possible reasoning errors and provide hints to make users doubt their beliefs when there is a possibility of harm. We present existing sources and methods, discuss on how natural language processing technologies could contribute to various aspects of well-being by giving examples of systems we develop, and describe the strengths and weaknesses of our approach.
Towards Interpretable Explanations for Transfer Learning in Sequential Tasks
Ramakrishnan, Ramya (Massachusetts Institute of Technology) | Shah, Julie ( Massachusetts Institute of Technology )
People increasingly rely on machine learning (ML) to make intelligent decisions. However, the ML results are often difficult to interpret and the algorithms do not support interaction to solicit clarification or explanation. In this paper, we highlight an emerging research area of interpretable explanations for transfer learning in sequential tasks, in which an agent must explain how it learns a new task given prior, common knowledge. The goal is to enhance a user's ability to trust and use the system output and to enable iterative feedback for improving the system. We review prior work in probabilistic systems, sequential decision-making, interpretable explanations, transfer learning, and interactive machine learning, and identify an intersection that deserves further research focus. We believe that developing adaptive, transparent learning models will build the foundation for better human-machine systems in applications for elder care, education, and health care.
A Review of Theoretical and Practical Challenges of Trusted Autonomy in Big Data
Abbass, Hussein A., Leu, George, Merrick, Kathryn
Despite the advances made in artificial intelligence, software agents, and robotics, there is little we see today that we can truly call a fully autonomous system. We conjecture that the main inhibitor for advancing autonomy is lack of trust. Trusted autonomy is the scientific and engineering field to establish the foundations and ground work for developing trusted autonomous systems (robotics and software agents) that can be used in our daily life, and can be integrated with humans seamlessly, naturally and efficiently. In this paper, we review this literature to reveal opportunities for researchers and practitioners to work on topics that can create a leap forward in advancing the field of trusted autonomy. We focus the paper on the `trust' component as the uniting technology between humans and machines. Our inquiry into this topic revolves around three sub-topics: (1) reviewing and positioning the trust modelling literature for the purpose of trusted autonomy; (2) reviewing a critical subset of sensor technologies that allow a machine to sense human states; and (3) distilling some critical questions for advancing the field of trusted autonomy. The inquiry is augmented with conceptual models that we propose along the way by recompiling and reshaping the literature into forms that enables trusted autonomous systems to become a reality. The paper offers a vision for a Trusted Cyborg Swarm, an extension of our previous Cognitive Cyber Symbiosis concept, whereby humans and machines meld together in a harmonious, seamless, and coordinated manner.
A Primer on the Signature Method in Machine Learning
Chevyrev, Ilya, Kormilitzin, Andrey
In these notes, we wish to provide an introduction to the signature method, focusing on its basic theoretical properties and recent numerical applications. The notes are split into two parts. The first part focuses on the definition and fundamental properties of the signature of a path, or the path signature. We have aimed for a minimalistic approach, assuming only familiarity with classical real analysis and integration theory, and supplementing theory with straightforward examples. We have chosen to focus in detail on the principle properties of the signature which we believe are fundamental to understanding its role in applications. We also present an informal discussion on some of its deeper properties and briefly mention the role of the signature in rough paths theory, which we hope could serve as a light introduction to rough paths for the interested reader. The second part of these notes discusses practical applications of the path signature to the area of machine learning. The signature approach represents a non-parametric way for extraction of characteristic features from data. The data are converted into a multi-dimensional path by means of various embedding algorithms and then processed for computation of individual terms of the signature which summarise certain information contained in the data. The signature thus transforms raw data into a set of features which are used in machine learning tasks. We will review current progress in applications of signatures to machine learning problems.
Metric Learning with Adaptive Density Discrimination
Rippel, Oren, Paluri, Manohar, Dollar, Piotr, Bourdev, Lubomir
Distance metric learning (DML) approaches learn a transformation to a representation space where distance is in correspondence with a predefined notion of similarity. While such models offer a number of compelling benefits, it has been difficult for these to compete with modern classification algorithms in performance and even in feature extraction. In this work, we propose a novel approach explicitly designed to address a number of subtle yet important issues which have stymied earlier DML algorithms. It maintains an explicit model of the distributions of the different classes in representation space. It then employs this knowledge to adaptively assess similarity, and achieve local discrimination by penalizing class distribution overlap. We demonstrate the effectiveness of this idea on several tasks. Our approach achieves state-of-the-art classification results on a number of fine-grained visual recognition datasets, surpassing the standard softmax classifier and outperforming triplet loss by a relative margin of 30-40%. In terms of computational performance, it alleviates training inefficiencies in the traditional triplet loss, reaching the same error in 5-30 times fewer iterations. Beyond classification, we further validate the saliency of the learnt representations via their attribute concentration and hierarchy recovery properties, achieving 10-25% relative gains on the softmax classifier and 25-50% on triplet loss in these tasks.
Optimally Solving Dec-POMDPs as Continuous-State MDPs
Dibangoye, Jilles Steeve, Amato, Christopher, Buffet, Olivier, Charpillet, François
Decentralized partially observable Markov decision processes (Dec-POMDPs) provide a general model for decision-making under uncertainty in decentralized settings, but are difficult to solve optimally (NEXP-Complete). As a new way of solving these problems, we introduce the idea of transforming a Dec-POMDP into a continuous-state deterministic MDP with a piecewise-linear and convex value function. This approach makes use of the fact that planning can be accomplished in a centralized offline manner, while execution can still be decentralized. This new Dec-POMDP formulation, which we call an occupancy MDP, allows powerful POMDP and continuous-state MDP methods to be used for the first time. To provide scalability, we refine this approach by combining heuristic search and compact representations that exploit the structure present in multi-agent domains, without losing the ability to converge to an optimal solution. In particular, we introduce a feature-based heuristic search value iteration (FB-HSVI) algorithm that relies on feature-based compact representations, point-based updates and efficient action selection. A theoretical analysis demonstrates that FB-HSVI terminates in finite time with an optimal solution. We include an extensive empirical analysis using well-known benchmarks, thereby demonstrating that our approach provides significant scalability improvements compared to the state of the art.