SAC: Accelerating and Structuring Self-Attention via Sparse Adaptive Connection

Oct-11-2024, 07:30:39 GMT–Neural Information Processing Systems

While the self-attention mechanism has been widely used in a wide variety of tasks, it has the unfortunate property of a quadratic cost with respect to the input length, which makes it difficult to deal with long inputs. In this paper, we present a method for accelerating and structuring self-attentions: Sparse Adaptive Connection (SAC). In SAC, we regard the input sequence as a graph and attention operations are performed between linked nodes. In contrast with previous self-attention models with pre-defined structures (edges), the model learns to construct attention edges to improve task-specific performances. In this way, the model is able to select the most salient nodes and reduce the quadratic complexity regardless of the sequence length.

accelerating and structuring self-attention, sac, sparse adaptive connection, (1 more...)

Neural Information Processing Systems

Oct-11-2024, 07:30:39 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.43)