AITopics | Deep Learning

Collaborating Authors

Deep Learning

New computational algorithms make it possible to build neural networks with many input nodes and many layers, and distinguish "deep learning" of these networks from previous work on artificial neural nets.

News Overviews Instructional Materials AI-Alerts Classics

maciejkula/spotlight

#artificialintelligenceSep-9-2017, 03:00:55 GMT

Large embedding layers are a performance problem for fitting models: even though the gradients are sparse (only a handful of user and item vectors need parameter updates in every minibatch), PyTorch updates the entire embedding layer at every backward pass. Computation time is then wasted on applying zero gradient steps to whole embedding matrix. To alleviate this problem, we can use a smaller underlying embedding layer, and probabilistically hash users and items into that smaller space. With good hash functions, collisions should be rare, and we should observe fitting speedups without a decrease in accuracy. The implementation in Spotlight follows the RecSys 2017 paper "Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks.".

artificial intelligence, implementation, machine learning, (19 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

What Are The Differences Between AI, Machine Learning, NLP, And Deep Learning?

#artificialintelligenceSep-9-2017, 01:20:28 GMT

What is the difference between AI, Machine Learning, NLP, and Deep Learning? AI (Artificial intelligence) is a subfield of computer science that was created in the 1960s, and it was/is concerned with solving tasks that are easy for humans but hard for computers. In particular, a so-called Strong AI would be a system that can do anything a human can (perhaps without purely physical things). This is fairly generic and includes all kinds of tasks such as planning, moving around in the world, recognizing objects and sounds, speaking, translating, performing social or business transactions, creative work (making art or poetry), etc. NLP (Natural language processing) is simply the part of AI that has to do with language (usually written). Machine learning is concerned with one aspect of this: given some AI problem that can be described in discrete terms (e.g.

artificial intelligence, deep learning, machine learning, (4 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)

Add feedback

Classifying Unordered Feature Sets with Convolutional Deep Averaging Networks

Gardner, Andrew, Kanno, Jinko, Duncan, Christian A., Selmic, Rastko R.

arXiv.org Machine LearningSep-9-2017

We propose convolutional deep averaging networks (CDANs) for classifying and learning feature representations of datasets containing instances with unordered features, where each feature is considered a tuple composed of one or more values. CDANs accept variable-size input and are invariant to permutations of the input's order. In addition, as a side-effect of the training process, CDANs learn discriminative, nonlinear embeddings of individual input elements into a space of chosen dimensionality. Contrary to their name, which is inspired by the work of Iyyer et al. [11], CDANs could perhaps be more accurately termed convolutional deep pooling networks as we also consider the effects of functions other than averaging such as taking element-wise maximums or sums. A. Contributions We propose CDANs for classifying unordered feature sets. We show that a CDAN with nonlinear embeddings is competitive with and perhaps even superior to recurrent neural networks (RNNs) and known permutation-invariant architectures for classifying instances containing variablesize sets of unordered features. We also find that the type of pooling plays a significant role in determining the efficacy of the network with sum-pooling clearly outperforming maxand average-pooling.

architecture, neural network, nonlinear, (14 more...)

arXiv.org Machine Learning

1709.03019

Country:

North America > United States > Louisiana > Lincoln Parish > Ruston (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
North America > United States > Connecticut > New Haven County > Hamden (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.51)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Complex spectrogram enhancement by convolutional neural network with multi-metrics learning

Fu, Szu-Wei, Hu, Ting-yao, Tsao, Yu, Lu, Xugang

arXiv.org Machine LearningSep-9-2017

This paper aims to address two issues existing in the current speech enhancement methods: 1) the difficulty of phase estimations; 2) a single objective function cannot consider multiple metrics simultaneously. To solve the first problem, we propose a novel convolutional neural network (CNN) model for complex spectrogram enhancement, namely estimating clean real and imaginary (RI) spectrograms from noisy ones. The reconstructed RI spectrograms are directly used to synthesize enhanced speech waveforms. In addition, since log-power spectrogram (LPS) can be represented as a function of RI spectrograms, its reconstruction is also considered as another target. Thus a unified objective function, which combines these two targets (reconstruction of RI spectrograms and LPS), is equivalent to simultaneously optimizing two commonly used objective metrics: segmental signal-to-noise ratio (SSNR) and logspectral distortion (LSD). Therefore, the learning process is called multi-metrics learning (MML). Experimental results confirm the effectiveness of the proposed CNN with RI spectrograms and MML in terms of improved standardized evaluation metrics on a speech enhancement task.

artificial intelligence, machine learning, spectrogram, (18 more...)

arXiv.org Machine Learning

1704.08504

Country:

Asia (0.69)
North America > United States (0.46)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Residual Networks and Weight Initialization

Taki, Masato

arXiv.org Machine LearningSep-9-2017

Residual Network (ResNet) is the state-of-the-art architecture that realizes successful training of really deep neural network. It is also known that good weight initialization of neural network avoids problem of vanishing/exploding gradients. In this paper, simplified models of ResNets are analyzed. We argue that goodness of ResNet is correlated with the fact that ResNets are relatively insensitive to choice of initial weights. We also demonstrate how batch normalization improves backpropagation of deep ResNets without tuning initial values of weights.

artificial intelligence, machine learning, resnet, (18 more...)

arXiv.org Machine Learning

1709.02956

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

tensorflow/agents

@machinelearnbotSep-8-2017, 23:46:27 GMT

This project provides optimized infrastructure for reinforcement learning. It extends the OpenAI gym interface to multiple parallel environments and allows agents to be implemented in TensorFlow and perform batched computation. As a starting point, we provide BatchPPO, an optimized implementation of Proximal Policy Optimization. The algorithm to use is defined in the configuration and pendulum started here uses the included PPO implementation. Check out more pre-defined configurations in agents/scripts/configs.py.

large language model, machine learning, reinforcement learning, (8 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.35)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Add feedback

Cognitive Toolkit Model Evaluation in UWP - Building Apps for Windows

#artificialintelligenceSep-8-2017, 23:36:20 GMT

We are excited to share with you that Microsoft Cognitive Toolkit (CNTK) 2.1 has added support for model evaluation on UWP applications. This means you can harness the power of deep learning in your Windows apps delivered via the Windows Store! Read on to find out how can infuse your apps with the power of AI. Cloud-connected devices can perform operations locally or delegate them to the cloud. The virtually unlimited compute power of the cloud makes it a good choice for running tasks that need significant compute power but don't require low latency.

artificial intelligence, cognitive toolkit model evaluation, machine learning, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Top /r/MachineLearning Posts, August: Andrew Ng is back at it; Reinforcement Learning makes a splash; Fixing your ANN

#artificialintelligenceSep-8-2017, 13:20:31 GMT

No doubt you have heard about it by now. Above is the link to the Reddit discussion, while this is the link to the Coursera specialization. So much to study, so little time!! Testing our agents in games that are not specifically designed for AI research, and where humans play well, is crucial to benchmark agent performance. That is why we, along with our partner Blizzard Entertainment, are excited to announce the release of SC2LE, a set of tools that we hope will accelerate AI research in the real-time strategy game StarCraft II. This includes an API for machine learning which hooks into a given game, a dataset of anonymized game replays (increasing to 500K in the coming weeks), and an open source version of PySC2, DeepMind's toolset.

artificial intelligence, machine learning, reinforcement learning make, (13 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games > Computer Games (0.96)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Add feedback

A practical guide to machine learning in business

@machinelearnbotSep-8-2017, 11:55:12 GMT

Machine learning is transforming business. But even as the technology advances, companies still struggle to take advantage of it, largely because they don't understand how to strategically implement machine learning in service of business goals. Hype hasn't helped, sowing confusion over what exactly machine learning is, how well it works and what it can do for your company. Here, we provide a clear-eyed look at what machine learning is and how it can be used today. Machine learning is a subset of artificial intelligence that enables systems to learn and predict outcomes without explicit programming.

artificial intelligence, machine learning, neural network, (16 more...)

@machinelearnbot

Country: Oceania > Australia (0.05)

Industry: Information Technology (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback