Goto

Collaborating Authors

 Deep Learning


An Introduction to different Types of Convolutions in Deep Learning

@machinelearnbot

Let me give you a quick overview of different types of convolutions and what their benefits are. First we need to agree on a few parameters that define a convolutional layer. Dilated convolutions introduce another parameter to convolutional layers called the dilation rate. This defines a spacing between the values in a kernel. A 3x3 kernel with a dilation rate of 2 will have the same field of view as a 5x5 kernel, while only using 9 parameters.


Time Series Prediction Using Recurrent Neural Networks (LSTMs) - DZone AI

#artificialintelligence

After training this model for 200 epochs or early_callbacks (whichever came first), the model tries to learn the pattern and the behavior of the data. Since we split the data into training and testing sets, we can now predict the value of testing data and compare them with the ground truth.


Four deep learning trends from ACL 2017

#artificialintelligence

This is the second of a two-part post in which I describe four broad research trends that I observed at ACL 2017. In Part One I explored the shifting assumptions we make about language, both at the sentence and the word level, and how these shifts are prompting both a comeback of linguistic structure and a re-evaluation of word embeddings. In this part, I will discuss two more very inter-related themes: interpretability and attention. Throughout, green links are ordinary hyperlinks, while blue links lead to papers, and offer bibliographic information when you hover over them (not supported on mobile). I've been thinking about interpretability a lot recently, and I'm not alone โ€“ among deep learning practitioners, the dreaded "black box" quality of neural networks makes them notoriously hard to control, hard to debug and thus hard to develop.


The Unreasonable Effectiveness of Recurrent Neural Networks

#artificialintelligence

Moreover, as we'll see in a bit, RNNs combine the input vector with their state vector with a fixed (but learned) function to produce a new state vector. If training vanilla neural nets is optimization over functions, training recurrent nets is optimization over programs. At the core, RNNs have a deceptively simple API: They accept an input vector x and give you an output vector y. Written as a class, the RNN's API consists of a single step function: The RNN class has some internal state that it gets to update every time step is called.


The Conditional Analogy GAN: Swapping Fashion Articles on People Images

arXiv.org Machine Learning

We present a novel method to solve image analogy problems : it allows to learn the relation between paired images present in training data, and then generalize and generate images that correspond to the relation, but were never seen in the training set. Therefore, we call the method Conditional Analogy Generative Adversarial Network (CAGAN), as it is based on adversarial training and employs deep convolutional neural networks. An especially interesting application of that technique is automatic swapping of clothing on fashion model photos. Our work has the following contributions. First, the definition of the end-to-end trainable CAGAN architecture, which implicitly learns segmentation masks without expensive supervised labeling data. Second, experimental results show plausible segmentation masks and often convincing swapped images, given the target article. Finally, we discuss the next steps for that technique: neural network architecture improvements and more advanced applications.


Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning

arXiv.org Machine Learning

Deep Learning has recently become hugely popular in machine learning, providing significant improvements in classification accuracy in the presence of highly-structured and large databases. Researchers have also considered privacy implications of deep learning. Models are typically trained in a centralized manner with all the data being processed by the same training algorithm. If the data is a collection of users' private data, including habits, personal pictures, geographical positions, interests, and more, the centralized server will have access to sensitive information that could potentially be mishandled. To tackle this problem, collaborative deep learning models have recently been proposed where parties locally train their deep learning structures and only share a subset of the parameters in the attempt to keep their respective training sets private. Parameters can also be obfuscated via differential privacy (DP) to make information extraction even more challenging, as proposed by Shokri and Shmatikov at CCS'15. Unfortunately, we show that any privacy-preserving collaborative deep learning is susceptible to a powerful attack that we devise in this paper. In particular, we show that a distributed, federated, or decentralized deep learning approach is fundamentally broken and does not protect the training sets of honest participants. The attack we developed exploits the real-time nature of the learning process that allows the adversary to train a Generative Adversarial Network (GAN) that generates prototypical samples of the targeted training set that was meant to be private (the samples generated by the GAN are intended to come from the same distribution as the training data). Interestingly, we show that record-level DP applied to the shared parameters of the model, as suggested in previous work, is ineffective (i.e., record-level DP is not designed to address our attack).


Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory

arXiv.org Artificial Intelligence

Perception and expression of emotion are key factors to the success of dialogue systems or conversational agents. However, this problem has not been studied in large-scale conversation generation so far. In this paper, we propose Emotional Chatting Machine (ECM) that can generate appropriate responses not only in content (relevant and grammatical) but also in emotion (emotionally consistent). To the best of our knowledge, this is the first work that addresses the emotion factor in large-scale conversation generation. ECM addresses the factor using three new mechanisms that respectively (1) models the high-level abstraction of emotion expressions by embedding emotion categories, (2) captures the change of implicit internal emotion states, and (3) uses explicit emotion expressions with an external emotion vocabulary. Experiments show that the proposed model can generate responses appropriate not only in content but also in emotion.


New-Age Machine Learning Algorithms in Retail Lending

@machinelearnbot

More than a decade back while joining a large US Credit Cards company, it was surprising to see that Predictive Analytics was limited to multivariate regression and logistic models. This was in contrast to previous stints at Start-Ups funded by NASA / NIST where a broader set of Machine Learning (ML) methods including SVMs, NNs, Random or Gradient Boosting Trees were regularly applied. There were a number of good reasons for using the simpler models in Retail Lending. Firstly, Decision Frameworks were already in place that made input feature selection a relatively simpler exercise. For e.g., for Credit Decisioning, one could think in terms of 5Cs of Credit (Character, Capacity, Capital, Collateral, Conditions), and search for Data variables that catered to them.


Evolving Government: Why government needs open-source deep learning - Fedscoop

#artificialintelligence

Deep learning is cutting edge artificial intelligence. It's what Google used to build AlphaGo, which beat the world champion of board game Go earlier this year in China. And it's being used by many of the world's top tech companies as the basis for recommender systems, fraud detection and cybersecurity. Government should be using deep learning, because it is a sophisticated tool that can help agencies fulfill their mission for use cases as diverse as risk profiling, cost forecasting and the analysis of satellite imagery. An additional benefit that supports both recent governmentwide policy and tight budgets is that most of the best deep-learning algorithms are open source.


Inside Facebook's Quest for Software That Understands You

#artificialintelligence

The first time Yann LeCun revolutionized artificial intelligence, it was a false dawn. It was 1995, and for almost a decade, the young Frenchman had been dedicated to what many computer scientists considered a bad idea: that crudely mimicking certain features of the brain was the best way to bring about intelligent machines. But LeCun had shown that this approach could produce something strikingly smart--and useful. Working at Bell Labs, he made software that roughly simulated neurons and learned to read handwritten text by looking at many different examples. Bell Labs' corporate parent, AT&T, used it to sell the first machines capable of reading the handwriting on checks and written forms. To LeCun and a few fellow believers in artificial neural networks, it seemed to mark the beginning of an era in which machines could learn many other skills previously limited to humans. "This whole project kind of disappeared on the day of its biggest success," says LeCun. On the same day he celebrated the launch of bank machines that could read thousands of checks per hour, AT&T announced it was splitting into three companies dedicated to different markets in communications and computing. LeCun became head of research at a slimmer AT&T and was directed to work on other things; in 2002 he would leave AT&T, soon to become a professor at New York University.