Evaluating Large Vision-and-Language Models on Children's Mathematical Olympiads

Neural Information Processing Systems 

Gemini, etc.; some of these breakthroughs even seem to enable AI models to outperform human abilities in varied tasks that demand higher-order cognitive skills.