AITopics | RNN

Collaborating Authors

RNN

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The Unreasonable Effectiveness of Recurrent Neural Networks

#artificialintelligenceSep-14-2017, 00:15:25 GMT

Moreover, as we'll see in a bit, RNNs combine the input vector with their state vector with a fixed (but learned) function to produce a new state vector. If training vanilla neural nets is optimization over functions, training recurrent nets is optimization over programs. At the core, RNNs have a deceptively simple API: They accept an input vector x and give you an output vector y. Written as a class, the RNN's API consists of a single step function: The RNN class has some internal state that it gets to update every time step is called.

deep learning, neural network, RNN, (19 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Neural Language Modeling From Scratch (Part 1)

#artificialintelligenceSep-11-2017, 17:30:27 GMT

The decoder is a simple function that takes a representation of the input word and returns a distribution which represents the model's predictions for the next word: the model assigns to each word the probability that it will be the next word in the sequence. This model is similar to the simple one, just that after encoding the current input word we feed the resulting representation (of size 200) into a two layer LSTM, which then outputs a vector also of size 200 (at every time step the LSTM also receives a vector representing its previous state- this is not shown in the diagram). In the input embedding, words that have similar meanings are represented by similar vectors (similar in terms of cosine similarity). Because the model would like to, given the RNN output, assign similar probability values to similar words, similar words are represented by similar vectors.

deep learning, neural network, probability, (20 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

2dn4Dzq

#artificialintelligenceSep-29-2016, 06:15:27 GMT

In blue we show the recurrent connections – the output'm' at time (t – 1) is fed back to the memory at time't' via the three gates; the cell value is fed back via the forget gate; the predicted word at time (t – 1) is fed back in addition to the memory output'm' at time't' into the Softmax for tag prediction. In spite of this fact, when we test images with multiple clothing type, our trained model generates tags for these unseen test images quite accurately ( 80% accurate). Prediction accuracy of our model improves quickly with increasing number of training iterations and stabilizes after about 20,000 iterations. Moreover, combining DCNN-RNN model helps us extend the trained model to solve completely different problem like fashion image tag generation.

deep learning, neural network, prediction, (21 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Deep Learning at x.ai - x.ai

#artificialintelligenceMar-25-2016, 13:26:34 GMT

When a RNN is trained on sequences of words, it learns to represent each word as a high dimensional vector which encodes the model's understanding of that word. If you take a step back and view the image as a whole, the large scale structure of the image is determined by words' part of speech. Nouns tend to lie in the center of the image, verbs tend to lie on the upper right side, and first names form a large orange cluster in the bottom left part of the image. The RNN learned all of this semantic understanding without a human ever having to code a definition of concepts like nouns, verbs, universities, cities, meetings, or social media.

CROWDSOURCING, deep learning, neural network, (19 more...)

#artificialintelligence

Industry: Information Technology > Services (0.78)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.99)

Add feedback