GlitchBench: Can large multimodal models detect video game glitches?

Open in new window