GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Open in new window