A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis

Gur, Izzeddin, Furuta, Hiroki, Huang, Austin, Safdari, Mustafa, Matsuo, Yutaka, Eck, Douglas, Faust, Aleksandra

Oct-2-2023–arXiv.org Artificial Intelligence

Pre-trained large language models (LLMs) have recently achieved better generalization and sample efficiency in autonomous web automation. However, the performance on real-world websites has still suffered from (1) open domainness, (2) limited context length, and (3) lack of inductive bias on HTML. We introduce WebAgent, an LLM-driven agent that learns from self-experience to complete tasks on real websites following natural language instructions. WebAgent plans ahead by decomposing instructions into canonical sub-instructions, summarizes long HTML documents into task-relevant snippets, and acts on websites via Python programs generated from those. We design WebAgent with Flan-U-PaLM, for grounded code generation, and HTML-T5, new pre-trained LLMs for long HTML documents using local and global attention mechanisms and a mixture of long-span denoising objectives, for planning and summarization. We empirically demonstrate that our modular recipe improves the success on real websites by over 50%, and that HTML-T5 is the best model to solve various HTML understanding tasks; achieving 18.7% higher success rate than the prior method on MiniWoB web automation benchmark, and SoTA performance on Mind2Web, an offline task planning evaluation.

arxiv preprint arxiv, language model, website, (12 more...)

arXiv.org Artificial Intelligence

Oct-2-2023

arXiv.org PDF

Add feedback

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- North America > United States
  - Pennsylvania > Philadelphia County
    - Philadelphia (0.04)
  - New York > New York County
    - New York City (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - Butte County > Oroville (0.04)
    - Stanislaus County > Modesto (0.04)
    - Marin County > Novato (0.04)
    - San Bernardino County > Victorville (0.04)
    - San Mateo County > Redwood City (0.04)
    - Sonoma County > Petaluma (0.04)
    - Alameda County
      - Oakland (0.04)
      - Livermore (0.04)
    - San Diego County
      - San Diego (0.04)
      - Escondido (0.04)
    - Contra Costa County
      - Walnut Creek (0.04)
      - Martinez (0.04)
      - Concord (0.04)
    - Santa Clara County
      - Santa Clara (0.04)
      - Palo Alto (0.04)
    - Los Angeles County
      - Inglewood (0.04)
      - Los Angeles > Hollywood (0.04)
      - Compton (0.04)
- Asia
  - Taiwan (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture
      - Tokyo (0.04)
    - Chūbu > Toyama Prefecture
      - Toyama (0.04)

Genre:
- Research Report (0.63)

Industry:
- Leisure & Entertainment (0.46)
- Information Technology (0.46)
- Banking & Finance > Real Estate (0.33)

Technology:
- Information Technology
  - Communications > Web (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language > Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found