Not All Images are Worth 16x16 Words: Dynamic Vision Transformers with Adaptive Sequence Length

Open in new window