RecurFormer: Not All Transformer Heads Need Self-Attention

Open in new window