Visual Sparse Steering: Improving Zero-shot Image Classification with Sparsity Guided Steering Vectors
Chatzoudis, Gerasimos, Li, Zhuowei, Moran, Gemma E., Wang, Hao, Metaxas, Dimitris N.
–arXiv.org Artificial Intelligence
Steering vision foundation models at inference time without retraining or access to large labeled datasets is a desirable yet challenging objective, particularly in dynamic or resource-constrained settings. In this paper, we introduce Visual Sparse Steering (VS2), a lightweight, test-time method that guides vision models using steering vectors derived from sparse features learned by top-$k$ Sparse Autoencoders without requiring contrastive data. Specifically, VS2 surpasses zero-shot CLIP by 4.12% on CIFAR-100, 1.08% on CUB-200, and 1.84% on Tiny-ImageNet. We further propose VS2++, a retrieval-augmented variant that selectively amplifies relevant sparse features using pseudo-labeled neighbors at inference time. With oracle positive/negative sets, VS2++ achieves absolute top-1 gains over CLIP zero-shot of up to 21.44% on CIFAR-100, 7.08% on CUB-200, and 20.47% on Tiny-ImageNet. Interestingly, VS2 and VS2++ raise per-class accuracy by up to 25% and 38%, respectively, showing that sparse steering benefits specific classes by disambiguating visually or taxonomically proximate categories rather than providing a uniform boost. Finally, to better align the sparse features learned through the SAE reconstruction task with those relevant for downstream performance, we propose Prototype-Aligned Sparse Steering (PASS). By incorporating a prototype-alignment loss during SAE training, using labels only during training while remaining fully test-time unsupervised, PASS consistently, though modestly, outperforms VS2, achieving a 6.12% gain over VS2 only on CIFAR-100 with ViT-B/32.
arXiv.org Artificial Intelligence
Jun-3-2025
- Country:
- Europe
- Latvia > Lubāna Municipality
- Lubāna (0.04)
- Switzerland > Zürich
- Zürich (0.14)
- Latvia > Lubāna Municipality
- North America > United States
- California (0.04)
- Europe
- Genre:
- Research Report (1.00)
- Technology: