Reviews: LightRNN: Memory and Computation-Efficient Recurrent Neural Networks

Jan-20-2025, 19:06:30 GMT–Neural Information Processing Systems

This work provides a novel and effective way to reduce the number of parameters for models that require handling of large vocabularies. The large drop in model size by several orders of magnitude could effectively allow some large models to be ported to the phone, which may not have been possible previously. I find it really interesting that a single method can improve both input parameter size and output size whereas previous work on softmaxes have only tackled the output side. However, I find that some technical details are lacking and the description can be confusing in some places. In particular, I find figure 2 and the unnumbered equation after Eq 1 confusing.

computation-efficient recurrent neural network, lightrnn, review, (1 more...)

Neural Information Processing Systems

Jan-20-2025, 19:06:30 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)