Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual Knowledge

Neural Information Processing Systems 

We explore those different aspects in relation to mutual knowledge, and analyze zero-shot predictions.