Limits to Depth Efficiencies of Self-Attention

Open in new window