Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution
–Neural Information Processing Systems
The ubiquitous and demonstrably suboptimal choice of resizing images to a fixed resolution before processing them with computer vision models has not yet been successfully challenged.
Neural Information Processing Systems
Apr-24-2026, 10:10:32 GMT