Review for NeurIPS paper: Untangling tradeoffs between recurrence and self-attention in artificial neural networks

Feb-7-2025, 08:20:00 GMT–Neural Information Processing Systems

The paper provides theoretical analysis of self-attention and vanishing gradients. Experiments are of toy problems with non-SOTA results but validate the main theoretical contributions of the paper.

artificial neural network, recurrence and self-attention, untangling tradeoff, (1 more...)

Neural Information Processing Systems

Feb-7-2025, 08:20:00 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.85)