Plancraft: an evaluation dataset for planning with LLM agents

Open in new window