On the Empirical Complexity of Reasoning and Planning in LLMs

Kang, Liwei, Zhao, Zirui, Hsu, David, Lee, Wee Sun

Jun-17-2024–arXiv.org Artificial Intelligence

Chain-of-thought (CoT), tree-of-thought (ToT), and related techniques work surprisingly well in practice for some complex reasoning tasks with Large Language Models (LLMs), but why? This work seeks the underlying reasons by conducting experimental case studies and linking the performance benefits to well-established sample and computational complexity principles in machine learning. We experimented with 6 reasoning tasks, ranging from grade school math, air travel planning, ..., to Blocksworld. The results suggest that (i) both CoT and ToT benefit significantly from task decomposition, which breaks a complex reasoning task into a sequence of steps with low sample complexity and explicitly outlines the reasoning structure, and (ii) for computationally hard reasoning tasks, the more sophisticated tree structure of ToT outperforms the linear structure of CoT. These findings provide useful guidelines for the use of LLM in solving reasoning tasks in practice.

blue block, item, orange block, (16 more...)

arXiv.org Artificial Intelligence

Jun-17-2024

arXiv.org PDF

Add feedback

Country:
- South America
  - Brazil > São Paulo (0.04)
  - Argentina > Pampas
    - Buenos Aires F.D. > Buenos Aires (0.04)
- North America
  - United States
    - New York (0.05)
    - Nevada > Clark County
      - Las Vegas (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Illinois > Cook County
      - Chicago (0.04)
    - California
      - Los Angeles County > Los Angeles (0.04)
      - San Francisco County > San Francisco (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Honduras > Francisco Morazán
    - Tegucigalpa (0.05)
  - Guatemala > Guatemala
    - Guatemala City (0.05)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Italy (0.04)
  - Belgium (0.04)
  - Russia > Central Federal District
    - Moscow Oblast > Moscow (0.04)
  - Switzerland > Zürich
    - Zürich (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - Czechia > Hradec Králové Region
    - Hradec Králové (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - United Kingdom > England
    - Hertfordshire (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
  - Middle East > Republic of Türkiye
    - Istanbul Province > Istanbul (0.04)
- Asia
  - Singapore (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - Pakistan > Punjab
    - Lahore Division > Lahore (0.04)
  - Middle East
    - Syria (0.04)
    - Iraq (0.04)
    - UAE
      - Dubai Emirate > Dubai (0.04)
      - Abu Dhabi Emirate > Abu Dhabi (0.04)
    - Republic of Türkiye > Istanbul Province
      - Istanbul (0.04)
  - Kazakhstan > Akmola Region
    - Astana (0.04)
  - India > Andhra Pradesh
    - Visakhapatnam (0.04)
  - China
    - Guangdong Province > Guangzhou (0.04)
    - Jiangsu Province > Nanjing (0.04)
    - Shanghai > Shanghai (0.04)
- Africa > Middle East
  - Libya (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Consumer Products & Services > Travel (0.34)
- Transportation > Air (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found