AITopics | computation-efficient recurrent neural network

Collaborating Authors

computation-efficient recurrent neural network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

Neural Information Processing SystemsNov-21-2025, 15:21:45 GMT

Recurrent neural networks (RNNs) have achieved state-of-the-art performances in many natural language processing tasks, such as language modeling and machine translation. However, when the vocabulary is large, the RNN model will become very big (e.g., possibly beyond the memory capacity of a GPU device) and its training will become very inefficient. In this work, we propose a novel technique to tackle this challenge. The key idea is to use 2-Component (2C) shared embedding for word representations. We allocate every word in the vocabulary into a table, each row of which is associated with a vector, and each column associated with another vector.

computation-efficient recurrent neural network, lightrnn, vector, (8 more...)

Neural Information Processing Systems

Genre: Research Report (0.38)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.63)

Add feedback

Reviews: LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

Neural Information Processing SystemsJan-20-2025, 19:06:30 GMT

This work provides a novel and effective way to reduce the number of parameters for models that require handling of large vocabularies. The large drop in model size by several orders of magnitude could effectively allow some large models to be ported to the phone, which may not have been possible previously. I find it really interesting that a single method can improve both input parameter size and output size whereas previous work on softmaxes have only tackled the output side. However, I find that some technical details are lacking and the description can be confusing in some places. In particular, I find figure 2 and the unnumbered equation after Eq 1 confusing.

computation-efficient recurrent neural network, lightrnn, review, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

Li, Xiang, Qin, Tao, Yang, Jian, Liu, Tie-Yan

Neural Information Processing SystemsFeb-14-2020, 15:56:58 GMT

computation-efficient recurrent neural network, lightrnn, vector, (6 more...)

Neural Information Processing Systems

Genre: Research Report (0.57)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.63)

Add feedback