Large Language Models as Commonsense Knowledge for Large-Scale Task Planning

Mar-26-2025, 08:54:21 GMT–Neural Information Processing Systems

Large-scale task planning is a major challenge. Recent work exploits large language models (LLMs) directly as a policy and shows surprisingly interesting results. This paper shows that LLMs provide a commonsense model of the world in addition to a policy that acts on it. The world model and the policy can be combined in a search algorithm, such as Monte Carlo Tree Search (MCTS), to scale up task planning. In our new LLM-MCTS algorithm, the LLM-induced world model provides a commonsense prior belief for MCTS to achieve effective reasoning; the LLM-induced policy acts as a heuristic to guide the search, vastly improving search efficiency. Experiments show that LLM-MCTS outperforms both MCTS alone and policies induced by LLMs (GPT2 and GPT3.5) by a wide margin for complex, novel tasks. Further experiments and analyses on multiple tasks--multiplication, travel planning, object rearrangement--suggest minimum description length (MDL) as a general guiding principle: if the description length of the world model is substantially smaller than that of the policy, using LLM as a world model for model-based planning is likely better than using LLM solely as a policy.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Mar-26-2025, 08:54:21 GMT

Conferences PDF

Add feedback

Country:
- Europe (0.92)

Genre:
- Research Report > New Finding (0.46)

Industry:
- Consumer Products & Services > Travel (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Computational Learning Theory > Minimum Complexity Machines (0.34)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.46)
    - Neural Networks > Deep Learning (0.52)
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Large Language Models as Commonsense Knowledge for Large-Scale Task Planning

Similar Docs Excel Report more

Title	Similarity	Source
None found