Elucidating the Design Space of Decay in Linear Attention

Open in new window