A Review of Vision-Language Models and their Performance on the Hateful Memes Challenge

Open in new window