Do Visual-Language Maps Capture Latent Semantics?

Open in new window