Quantifying and Enabling the Interpretability of CLIP-like Models

Open in new window