LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs

Open in new window