SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Open in new window