Goto

Collaborating Authors

 Deep Learning


Google open-sources TensorFlow training tools to ease machine learning

#artificialintelligence

Over the past year, Google's TensorFlow has asserted itself as a popular open source toolkit for deep learning. But training a TensorFlow model can be cumbersome and slow--especially when the mission is to take a dataset used by someone else and try to refine the training process it uses. The sheer number of moving parts and variations in any model-training process is enough to make even deep-learning experts take a deep breath. This week, Google open-sourced a project intended to cut down on the amount of work in configuring a deep learning model for training. Tensor2Tensor, or T2T for short, is a Python-powered workflow organization library for TensorFlow training jobs. It lets developers specify the key elements used in a TensorFlow model and define the relationships among them.


Neuroevolution: A different kind of deep learning

#artificialintelligence

Early price ends August 4. Neuroevolution is making a comeback. Prominent artificial intelligence labs and researchers are experimenting with it, a string of new successes have bolstered enthusiasm, and new opportunities for impact in deep learning are emerging. Maybe you haven't heard of neuroevolution in the midst of all the excitement over deep learning, but it's been lurking just below the surface, the subject of study for a small, enthusiastic research community for decades. And it's starting to gain more attention as people recognize its potential. Put simply, neuroevolution is a subfield within artificial intelligence (AI) and machine learning (ML) that consists of trying to trigger an evolutionary process similar to the one that produced our brains, except inside a computer. In other words, neuroevolution seeks to develop the means of evolving neural networks through evolutionary algorithms. When I first waded into AI research in the late 1990s, the idea that brains could be evolved inside computers resonated with my sense of adventure. At that time, it was an unusual, even obscure field, but I felt a deep curiosity and affinity. The result has been 20 years of my life thinking about this subject, and a slew of algorithms developed with outstanding colleagues over the years, such as NEAT, HyperNEAT, and novelty search. In this article, I hope to convey some of the excitement of neuroevolution as well as provide insight into its issues, but without the opaque technical jargon of scientific articles. I have also taken, in part, an autobiographical perspective, reflecting my own deep involvement within the field.


Soon We Won't Program Computers. We'll Train Them Like Dogs

#artificialintelligence

Before the invention of the computer, most experimental psychologists thought the brain was an unknowable black box. You could analyze a subject's behavior--ring bell, dog salivates--but thoughts, memories, emotions? That stuff was obscure and inscrutable, beyond the reach of science. So these behaviorists, as they called themselves, confined their work to the study of stimulus and response, feedback and reinforcement, bells and saliva. They gave up trying to understand the inner workings of the mind. They ruled their field for four decades. Then, in the mid-1950s, a group of rebellious psychologists, linguists, information theorists, and early artificial-intelligence researchers came up with a different conception of the mind.


[P] ML Algorithm to Replace Controller โ€ข r/MachineLearning

@machinelearnbot

I'd like to use an algorithm uController to replace the control unit of a system. The controller is a PI controller for a power system. The objective is to optimize the response. Can a deep learning algorithm do this?


Perturbation Training for Human-Robot Teams

Journal of Artificial Intelligence Research

In this work, we design and evaluate a computational learning model that enables a human-robot team to co-develop joint strategies for performing novel tasks that require coordination. The joint strategies are learned through "perturbation training," a human team-training strategy that requires team members to practice variations of a given task to help their team generalize to new variants of that task. We formally define the problem of human-robot perturbation training and develop and evaluate the first end-to-end framework for such training, which incorporates a multi-agent transfer learning algorithm, human-robot co-learning framework and communication protocol. Our transfer learning algorithm, Adaptive Perturbation Training (AdaPT), is a hybrid of transfer and reinforcement learning techniques that learns quickly and robustly for new task variants. We empirically validate the benefits of AdaPT through comparison to other hybrid reinforcement and transfer learning techniques aimed at transferring knowledge from multiple source tasks to a single target task. We also demonstrate that AdaPT's rapid learning supports live interaction between a person and a robot, during which the human-robot team trains to achieve a high level of performance for new task variants. We augment AdaPT with a co-learning framework and a computational bi-directional communication protocol so that the robot can co-train with a person during live interaction. Results from large-scale human subject experiments (n=48) indicate that AdaPT enables an agent to learn in a manner compatible with a human's own learning process, and that a robot undergoing perturbation training with a human results in a high level of team performance. Finally, we demonstrate that human-robot training using AdaPT in a simulation environment produces effective performance for a team incorporating an embodied robot partner.


Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging

arXiv.org Machine Learning

In this paper we show that reporting a single performance score is insufficient to compare non-deterministic approaches. We demonstrate for common sequence tagging tasks that the seed value for the random number generator can result in statistically significant (p < 10^-4) differences for state-of-the-art systems. For two recent systems for NER, we observe an absolute difference of one percentage point F1-score depending on the selected seed value, making these systems perceived either as state-of-the-art or mediocre. Instead of publishing and reporting single performance scores, we propose to compare score distributions based on multiple executions. Based on the evaluation of 50.000 LSTM-networks for five sequence tagging tasks, we present network architectures that produce both superior performance as well as are more stable with respect to the remaining hyperparameters.


Bayesian Sparsification of Recurrent Neural Networks

arXiv.org Machine Learning

Recurrent neural networks show state-of-the-art results in many text analysis tasks but often require a lot of memory to store their weights. Recently proposed Sparse V ariational Dropout (Molchanov et al., 2017) eliminates the majority of the weights in a feed-forward neural network without significant loss of quality. We apply this technique to sparsify recurrent neural networks. To account for recurrent specifics we also rely on Binary V ariational Dropout for RNN (Gal & Ghahramani, 2016b). We report 99.5% sparsity level on sentiment analysis task without a quality drop and up to 87% sparsity level on language modeling task with slight loss of accuracy.


Universal Adversarial Perturbations Against Semantic Image Segmentation

arXiv.org Machine Learning

While deep learning is remarkably successful on perceptual tasks, it was also shown to be vulnerable to adversarial perturbations of the input. These perturbations denote noise added to the input that was generated specifically to fool the system while being quasi-imperceptible for humans. More severely, there even exist universal perturbations that are input-agnostic but fool the network on the majority of inputs. While recent work has focused on image classification, this work proposes attacks against semantic image segmentation: we present an approach for generating (universal) adversarial perturbations that make the network yield a desired target segmentation as output. We show empirically that there exist barely perceptible universal noise patterns which result in nearly the same predicted segmentation for arbitrary inputs. Furthermore, we also show the existence of universal noise which removes a target class (e.g., all pedestrians) from the segmentation while leaving the segmentation mostly unchanged otherwise.


Open Source Stories: The People Behind OpenAI

#artificialintelligence

You might think, based on the type of research they're doing, that the OpenAI office would be full of gadgets, full of wonder, full of weird experiments. There are no Faraday cages. Well, okay, there is a robot. And it's tucked away in a side room. It's surrounded by cobbled-together protective material so that it doesn't smash into itself if it starts flailing about due to a programming error.


Google Launches Free Course on Deep Learning: The Science of Teaching Computers How to Teach Themselves

#artificialintelligence

Last Friday, we mentioned how Google's artificial intelligence software DeepMind has the ability to teach itself many things. It can teach itself how to walk, jump and run. Or defeat the world's best player of the Chinese strategy game, Go. The science of teaching computers how to do things is called Deep Learning. Offered through Udacity, the course is taught by Vincent Vanhoucke, the technical lead in Google's Brain team.