AITopics | Deep Learning

Collaborating Authors

Deep Learning

New computational algorithms make it possible to build neural networks with many input nodes and many layers, and distinguish "deep learning" of these networks from previous work on artificial neural nets.

News Overviews Instructional Materials AI-Alerts Classics

Stochastic Training of Neural Networks via Successive Convex Approximations

Scardapane, Simone, Di Lorenzo, Paolo

arXiv.org Machine LearningJun-15-2017

This paper proposes a new family of algorithms for training neural networks (NNs). These are based on recent developments in the field of non-convex optimization, going under the general name of successive convex approximation (SCA) techniques. The basic idea is to iteratively replace the original (non-convex, highly dimensional) learning problem with a sequence of (strongly convex) approximations, which are both accurate and simple to optimize. Differently from similar ideas (e.g., quasi-Newton algorithms), the approximations can be constructed using only first-order information of the neural network function, in a stochastic fashion, while exploiting the overall structure of the learning problem for a faster convergence. We discuss several use cases, based on different choices for the loss function (e.g., squared loss and cross-entropy loss), and for the regularization of the NN's weights. We experiment on several medium-sized benchmark problems, and on a large-scale dataset involving simulated physical data. The results show how the algorithm outperforms state-of-the-art techniques, providing faster convergence to a better minimum. Additionally, we show how the algorithm can be easily parallelized over multiple computational units without hindering its performance. In particular, each computational unit can optimize a tailored surrogate function defined on a randomly assigned subset of the input variables, whose dimension can be selected depending entirely on the available computational power.

algorithm, artificial intelligence, machine learning, (12 more...)

arXiv.org Machine Learning

1706.04769

Country: Europe (0.28)

Genre:

Research Report > New Finding (0.34)
Research Report > Experimental Study (0.34)
Research Report > Promising Solution (0.34)

Industry: Education > Focused Education > Special Education (0.44)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Deep adversarial neural decoding

Güçlütürk, Yağmur, Güçlü, Umut, Seeliger, Katja, Bosch, Sander, van Lier, Rob, van Gerven, Marcel

arXiv.org Machine LearningJun-15-2017

Here, we present a novel approach to solve the problem of reconstructing perceived stimuli from brain responses by combining probabilistic inference with deep learning. Our approach first inverts the linear transformation from latent features to brain responses with maximum a posteriori estimation and then inverts the nonlinear transformation from perceived stimuli to latent features with adversarial training of convolutional neural networks. We test our approach with a functional magnetic resonance imaging experiment and show that it can generate state-of-the-art reconstructions of perceived faces from brain activations.

artificial intelligence, machine learning, reconstruction, (14 more...)

arXiv.org Machine Learning

1705.07109

Country: Europe > Netherlands (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.97)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Clustering and Conventional Networks for Music Separation: Stronger Together

Luo, Yi, Chen, Zhuo, Hershey, John R., Roux, Jonathan Le, Mesgarani, Nima

arXiv.org Machine LearningJun-15-2017

Deep clustering is the first method to handle general audio separation scenarios with multiple sources of the same type and an arbitrary number of sources, performing impressively in speaker-independent speech separation tasks. However, little is known about its effectiveness in other challenging situations such as music source separation. Contrary to conventional networks that directly estimate the source signals, deep clustering generates an embedding for each time-frequency bin, and separates sources by clustering the bins in the embedding space. We show that deep clustering outperforms conventional networks on a singing voice separation task, in both matched and mismatched conditions, even though conventional networks have the advantage of end-to-end training for best signal approximation, presumably because its more flexible objective engenders better regularization. Since the strengths of deep clustering and conventional network architectures appear complementary, we explore combining them in a single hybrid network trained via an approach akin to multi-task learning. Remarkably, the combination significantly outperforms either of its components.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Machine Learning

doi: 10.1109/ICASSP.2017.7952118

1611.06265

Country: North America > United States (0.46)

Genre: Research Report (1.00)

Industry: Automobiles & Trucks (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

A Simple and Accurate Syntax-Agnostic Neural Model for Dependency-based Semantic Role Labeling

Marcheggiani, Diego, Frolov, Anton, Titov, Ivan

arXiv.org Artificial IntelligenceJun-15-2017

We introduce a simple and accurate neural model for dependency-based semantic role labeling. Our model predicts predicate-argument dependencies relying on states of a bidirectional LSTM encoder. The semantic role labeler achieves competitive performance on English, even without any kind of syntactic information and only using local inference. However, when automatically predicted part-of-speech tags are provided as input, it substantially outperforms all previous local models and approaches the best reported results on the English CoNLL-2009 dataset. We also consider Chinese, Czech and Spanish where our approach also achieves competitive results. Syntactic parsers are unreliable on out-of-domain data, so standard (i.e., syntactically-informed) SRL models are hindered when tested in this setting. Our syntax-agnostic model appears more robust, resulting in the best reported results on standard out-of-domain test sets.

machine learning, natural language, predicate, (19 more...)

arXiv.org Artificial Intelligence

1701.02593

Country: North America > United States (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Learning and Remote Sensing. – Geo.Appsmith

#artificialintelligenceJun-14-2017, 22:50:13 GMT

Watch the video below to understand smart processing of satellite imagery using deep learning.

artificial intelligence, deep learning and remote sensing, machine learning, (2 more...)

#artificialintelligence

Industry: Energy > Renewable > Geothermal > Geothermal Energy Exploration and Development > Geophysical Analysis & Survey (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Getting Started with Deep Learning

@machinelearnbotJun-14-2017, 21:45:14 GMT

This article was written by Matthew Rubashkin. With a background in optical physics and biomedical research, Matthew has a broad range of experiences in software development, database engineering, and data analytics. At SVDS, our R&D team has been investigating different deep learning technologies, from recognizing images of trains to speech recognition. We needed to build a pipeline for ingesting data, creating a model, and evaluating the model performance. However, when we researched what technologies were available, we could not find a concise summary document to reference for starting a new deep learning project.

artificial intelligence, deep learning, machine learning, (1 more...)

@machinelearnbot

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

This robot uses deep learning to write and play its own music

#artificialintelligenceJun-14-2017, 21:05:29 GMT

Artificial intelligence has proved itself incredibly capable of analysing images, now its getting rhythm in the form of a four-armed, marimba-playing robot. The robot, named Shimon, was given a vast amount of musical data: more than 5,000 complete songs, two million motifs, riffs and short passages of music by researchers at Georgia Institute of Technology. It was then asked to compose and perform its own music. It's been in development for some years, but this is the first time it has composed its own music. Once it had been fed the data it was able to use deep learning techniques to create two 30 second pieces of original music.

artificial intelligence, machine learning, music, (7 more...)

#artificialintelligence

Industry:

Media > Music (0.34)
Leisure & Entertainment (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback

Drilling Down into Machine Learning and Deep Learning

#artificialintelligenceJun-14-2017, 21:05:28 GMT

Deep learning in turn, is subclass of machine learning that creates machines that use methods originally inspired by how a cat's brain reacted with light signals and then generalized to mimic the human brain's ability to learn. Until recently, we simply didn't have enough data and proces- sing power to train a machine to learn. Deep neural networks (DNNs) learn at many levels of abstraction, ranging from simple concepts to complex ones. This is what designates the "deep" in deep learning. Each layer in the neural network categorizes some kind of information, refines it, and passes it along to the next layer.

artificial intelligence, drilling, machine learning and deep learning, (5 more...)

#artificialintelligence

Country: North America > Canada > Quebec > Montreal (0.07)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ElementAI raises historic $137.5 million Series A round

#artificialintelligenceJun-14-2017, 21:05:14 GMT

Element AI, the Montreal-based artificial intelligence (AI) powerhouse, today announced it has raised $137.5 million ($102 million USD) in Series A funding, the largest in history for an AI company. The round was led by Data Collective (DCVC) with further investments from Real Ventures, Development Bank of Canada (BDC), Fidelity Investments Canada, Hanwha Investment, Intel Capital, Microsoft Ventures, National Bank of Canada, NVIDIA, Tencent, and several of the world's largest sovereign wealth funds. READ ALSO: Yoshua Bengio and friends launch'AI startup factory' The funding will allow Element AI to invest in large-scale AI projects internationally, solidifying its position as the largest global AI company in Canada and creating 250 jobs in the Canadian high tech sector by January 2018. Co-founded by serial entrepreneurs Jean-François Gagné and Nicolas Chapados, Real Ventures and Yoshua Bengio, a co-father of deep learning technology, Element AI aims to bring academic AI innovation to global organizations. Started in October 2016 to empower industry with the massive scale of academic AI innovation Bengio was driving at the world-leading Montreal Institute of Learning Algorithms (MILA), the two groups pioneered a unique, non-exploitative model of academic cooperation they have since replicated at many other institutes.

artificial intelligence, element ai, machine learning, (9 more...)

#artificialintelligence

Country: North America > Canada > Quebec > Montreal (0.78)

Industry: Banking & Finance > Capital Markets (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.57)

Add feedback

Forget AlphaGo, DeepMind has a more interesting step toward general AI

#artificialintelligenceJun-14-2017, 21:05:07 GMT

AlphaGo and self-driving cars are amazingly clever, but neither represents a very big leap toward general artificial intelligence. Fortunately, some AI researchers are developing ways of broadening machine intelligence. The researchers at DeepMind, which created the champion Go-playing robot AlphaGo, are working on an approach that could prove significant in the quest to make machines as intelligent as we are. In two papers published this week and reported by New Scientist, researchers at the Alphabet subsidiary describe efforts to teach computers about relational reasoning, a cognitive capability that is foundational to human intelligence. Simply put, relational reasoning is the ability to consider relationships between different mental representations, such as objects, words, or ideas.

artificial intelligence, deep learning, machine learning, (8 more...)

#artificialintelligence

Industry:

Leisure & Entertainment > Games > Go (1.00)
Information Technology > Software (0.84)

Technology:

Information Technology > Artificial Intelligence > Games > Go (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.77)

Add feedback