AITopics | Deep Learning

Collaborating Authors

Deep Learning

New computational algorithms make it possible to build neural networks with many input nodes and many layers, and distinguish "deep learning" of these networks from previous work on artificial neural nets.

News Overviews Instructional Materials AI-Alerts Classics

Elon Musk's lab forced bots to create their own language

#artificialintelligenceMar-20-2017, 01:20:10 GMT

Have you ever experienced the dread of overhearing two people, speaking a language you don't understand, begin laughing wildly? You just have to wonder what it is they're talking about, and if it's a joke at your expense. Heck, maybe you even check your teeth to make sure you aren't walking around with half of your lunchtime ham sandwich stuck to your gums. As Wired reports, researchers at OpenAI have made some huge strides in getting bots to communicate with each other, and without actually telling them how to do so. The group published a research paper earlier this week explaining exactly how they were able to accomplish the complex task, and it's all based on reinforcement learning.

machine learning, natural language, reinforcement learning, (6 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

New AI Can Write and Rewrite Its Own Code to Increase Its Intelligence • WorldNews

#artificialintelligenceMar-20-2017, 01:20:08 GMT

The old adage that practice makes perfect applies to machines as well, as many of today's artificially intelligent devices rely on repetition to learn. Deep-learning algorithms are designed to allow AI devices to glean knowledge from datasets and then apply what they've learned to concrete situations. For example, an AI system is fed data about how the sky is usually blue, which allows it to later recognize the sky in a series of images. Complex work can be accomplished using this method, but it certainly leaves something to be desired. For instance, could the same results be obtained by exposing deep-learning AI to fewer examples?

artificial intelligence, machine learning, write and rewrite, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.79)

Add feedback

Learning from the Hindsight Plan -- Episodic MPC Improvement

Tamar, Aviv, Thomas, Garrett, Zhang, Tianhao, Levine, Sergey, Abbeel, Pieter

arXiv.org Artificial IntelligenceMar-20-2017

Model predictive control (MPC) is a popular control method that has proved effective for robotics, among other fields. MPC performs re-planning at every time step. Re-planning is done with a limited horizon per computational and real-time constraints and often also for robustness to potential model errors. However, the limited horizon leads to suboptimal performance. In this work, we consider the iterative learning setting, where the same task can be repeated several times, and propose a policy improvement scheme for MPC. The main idea is that between executions we can, offline, run MPC with a longer horizon, resulting in a hindsight plan. To bring the next real-world execution closer to the hindsight plan, our approach learns to re-shape the original cost function with the goal of satisfying the following property: short horizon planning (as realistic during real executions) with respect to the shaped cost should result in mimicking the hindsight plan. This effectively consolidates long-term reasoning into the short-horizon planning. We empirically evaluate our approach in contact-rich manipulation tasks both in simulated and real environments, such as peg insertion by a real PR2 robot.

downstream oil & gas, mpc, neural network, (22 more...)

arXiv.org Artificial Intelligence

1609.09001

Genre: Research Report (1.00)

Industry: Energy > Oil & Gas > Downstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)
(2 more...)

Add feedback

A Survey of Available Corpora for Building Data-Driven Dialogue Systems

Serban, Iulian Vlad, Lowe, Ryan, Henderson, Peter, Charlin, Laurent, Pineau, Joelle

arXiv.org Artificial IntelligenceMar-20-2017

During the past decade, several areas of speech and language understanding have witnessed substantial breakthroughs from the use of data-driven models. In the area of dialogue systems, the trend is less obvious, and most practical systems are still built through significant engineering and expert knowledge. Nevertheless, several recent results suggest that data-driven approaches are feasible and quite promising. To facilitate research in this area, we have carried out a wide survey of publicly available datasets suitable for data-driven learning of dialogue systems. We discuss important characteristics of these datasets, how they can be used to learn diverse dialogue strategies, and their other potential uses. We also examine methods for transfer learning between datasets and the use of external knowledge. Finally, we discuss appropriate choice of evaluation metrics for the learning objective.

information retrieval, machine learning, reinforcement learning, (25 more...)

arXiv.org Artificial Intelligence

1512.05742

Country:

North America > United States (1.00)
Europe (1.00)

Genre:

Overview (1.00)
Research Report > New Finding (0.34)

Industry:

Media > Television (1.00)
Media > Film (1.00)
Health & Medicine (1.00)
(5 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
(13 more...)

Add feedback

Deep Sets

Zaheer, Manzil, Kottur, Satwik, Ravanbakhsh, Siamak, Poczos, Barnabas, Salakhutdinov, Ruslan, Smola, Alexander

arXiv.org Machine LearningMar-20-2017

In this paper, we study the problem of designing objective functions for machine learning problems defined on finite \emph{sets}. In contrast to traditional objective functions defined for machine learning problems operating on finite dimensional vectors, the new objective functions we propose are operating on finite sets and are invariant to permutations. Such problems are widespread, ranging from estimation of population statistics \citep{poczos13aistats}, via anomaly detection in piezometer data of embankment dams \citep{Jung15Exploration}, to cosmology \citep{Ntampaka16Dynamical,Ravanbakhsh16ICML1}. Our main theorem characterizes the permutation invariant objective functions and provides a family of functions to which any permutation invariant objective function must belong. This family of functions has a special structure which enables us to design a deep network architecture that can operate on sets and which can be deployed on a variety of scenarios including both unsupervised and supervised learning tasks. We demonstrate the applicability of our method on population statistic estimation, point cloud classification, set expansion, and image tagging.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

1703.06114

Country: North America > United States (0.67)

Genre: Research Report > New Finding (0.68)

Industry:

Leisure & Entertainment > Sports (0.46)
Education > Focused Education > Special Education (0.44)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.68)

Add feedback

Learning to Generate Samples from Noise through Infusion Training

Bordes, Florian, Honari, Sina, Vincent, Pascal

arXiv.org Machine LearningMar-20-2017

In this work, we investigate a novel training procedure to learn a generative model as the transition operator of a Markov chain, such that, when applied repeatedly on an unstructured random noise sample, it will denoise it into a sample that matches the target distribution from the training set. The novel training procedure to learn this progressive denoising operation involves sampling from a slightly different chain than the model chain used for generation in the absence of a denoising target. In the training chain we infuse information from the training target example that we would like the chains to reach with a high probability. The thus learned transition operator is able to produce quality and varied samples in a small number of steps. Experiments show competitive results compared to the samples generated with a basic Generative Adversarial Net

artificial intelligence, infusion rate, machine learning, (16 more...)

arXiv.org Machine Learning

1703.06975

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > Canada > Quebec > Montreal (0.14)

Genre:

Research Report (0.50)
Workflow (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.35)

Add feedback

Value Iteration Networks

Tamar, Aviv, Wu, Yi, Thomas, Garrett, Levine, Sergey, Abbeel, Pieter

arXiv.org Artificial IntelligenceMar-20-2017

We introduce the value iteration network (VIN): a fully differentiable neural network with a `planning module' embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation. We evaluate VIN based policies on discrete and continuous path-planning domains, and on a natural-language based search task. We show that by learning an explicit planning computation, VIN policies generalize better to new, unseen domains.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

arXiv.org Artificial Intelligence

1602.02867

Genre: Research Report > New Finding (0.68)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DeepMind organises its AI researchers into 'strike teams' and 'frontiers'

#artificialintelligenceMar-19-2017, 20:15:12 GMT

The company, which is on a mission to "solve intelligence," has hired some of the brightest minds in the world, including academics from Oxbridge and research scientists from firms like Facebook and Microsoft. Exactly how DeepMind's researchers work together has been something of a mystery but the FT story sheds new light on the matter. Researchers at DeepMind are divided into four main groups, including a "neuroscience" group and a "frontiers" group, according to the report. The frontiers group is said to be full of physicists and mathematicians who are tasked with testing some of the most futuristic AI theories. "We've hired 250 of the world's best scientists, so obviously they're here to let their creativity run riot, and we try and create an environment that's perfect for that," DeepMind CEO Demis Hassabis told the FT.

large language model, machine learning, natural language, (11 more...)

#artificialintelligence

Country: Europe > United Kingdom (0.18)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Trask on Twitter

#artificialintelligenceMar-19-2017, 16:50:35 GMT

deep learning, social media, twitter, (3 more...)

#artificialintelligence

Country: Europe > United Kingdom > England > Oxfordshire > Oxford (0.52)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Communications > Social Media (0.85)

Add feedback

Here's How Pharma Is Using AI Deep Learning To Cure Aging :: The Market Oracle ::

#artificialintelligenceMar-19-2017, 16:50:15 GMT

BY PATRICK COX: In 2011, scientists made one of the most important discoveries in the history of AI development. They found that graphics processing units (GPUs) are far better at simulating biological learning than central processing units (CPUs). In retrospect, it seems obvious. Human brains are much more like GPUs than CPUs. Both brains and GPUs rely on parallel processing that simulates and predicts real world physics.

artificial intelligence, machine learning, market oracle, (15 more...)

#artificialintelligence

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback