Dynamic Grained Encoder for Vision Transformers Lin Song
–Neural Information Processing Systems
Transformers, the de-facto standard for language modeling, have been recently applied for vision tasks. This paper introduces sparse queries for vision transformers to exploit the intrinsic spatial redundancy of natural images and save computational costs.
Neural Information Processing Systems
Oct-3-2025, 04:33:54 GMT
- Country:
- Asia > China
- Guangxi Province > Nanning (0.05)
- Shaanxi Province > Xi'an (0.04)
- Shanghai > Shanghai (0.04)
- South America > Chile
- Asia > China
- Genre:
- Research Report > New Finding (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.94)
- Natural Language (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence