Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP

Feb-11-2026, 18:07:01 GMT–Neural Information Processing Systems

Vision-language pre-training methods, e.g., CLIP, demonstrate an impressive zero-shot performance on visual categorizations with the class proxy from the text embedding of the class name.

large language model, machine learning, proxy, (19 more...)

Neural Information Processing Systems

Feb-11-2026, 18:07:01 GMT

Conferences PDF

Country:
- North America > United States
  - Washington > Pierce County > Tacoma (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > China
  - Zhejiang Province > Hangzhou (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks (0.68)

Duplicate Docs Excel Report

Title
Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP

Similar Docs Excel Report more

Title	Similarity	Source
None found