Y our representations are in the network: composable and parallel adaptation for large scale models
–Neural Information Processing Systems
On the ViT -L/16 architecture, our experiments show that a single adapter, 1.3% of the full model, is able to reach full fine-tuning accuracy on average across 11 challenging downstream classification tasks. Compared with other forms of parameter-efficient adaptation, the isolated nature of the InCA adaptation is computationally desirable for large-scale models. For instance, we adapt ViT -G/14 (1.8B+ parameters) quickly with 20+ adapters in parallel on a single V100 GPU (76% GPU memory reduction) and exhaustively identify its
Neural Information Processing Systems
Feb-12-2026, 07:48:41 GMT
- Country:
- Asia
- Middle East
- South Korea > Seoul
- Seoul (0.04)
- Europe
- France (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Romania > Sud - Muntenia Development Region
- Giurgiu County > Giurgiu (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Colorado > El Paso County
- Colorado Springs (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Colorado > El Paso County
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia
- Genre:
- Research Report (0.67)
- Technology: