Make A Long Image Short: Adaptive Token Length for Vision Transformers

Open in new window