Even Sparser Graph Transformers

Neural Information Processing Systems 

As the learned attention mechanisms tend to use few of these edges, such high-degree connections may be unnecessary.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found