Benchmarking Vision Language Models for Cultural Understanding

Open in new window