Interpreting ResNet-based CLIP via Neuron-Attention Decomposition

Open in new window