PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change

Neural Information Processing Systems 

These works have largely been suggesting that LLM's are indeed

Similar Docs  Excel Report  more

TitleSimilaritySource
None found