A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
Gur, Izzeddin, Furuta, Hiroki, Huang, Austin, Safdari, Mustafa, Matsuo, Yutaka, Eck, Douglas, Faust, Aleksandra
–arXiv.org Artificial Intelligence
Pre-trained large language models (LLMs) have recently achieved better generalization and sample efficiency in autonomous web automation. However, the performance on real-world websites has still suffered from (1) open domainness, (2) limited context length, and (3) lack of inductive bias on HTML. We introduce WebAgent, an LLM-driven agent that learns from self-experience to complete tasks on real websites following natural language instructions. WebAgent plans ahead by decomposing instructions into canonical sub-instructions, summarizes long HTML documents into task-relevant snippets, and acts on websites via Python programs generated from those. We design WebAgent with Flan-U-PaLM, for grounded code generation, and HTML-T5, new pre-trained LLMs for long HTML documents using local and global attention mechanisms and a mixture of long-span denoising objectives, for planning and summarization. We empirically demonstrate that our modular recipe improves the success on real websites by over 50%, and that HTML-T5 is the best model to solve various HTML understanding tasks; achieving 18.7% higher success rate than the prior method on MiniWoB web automation benchmark, and SoTA performance on Mind2Web, an offline task planning evaluation.
arXiv.org Artificial Intelligence
Oct-2-2023
- Country:
- Asia
- Japan > Honshū
- Chūbu > Toyama Prefecture
- Toyama (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Chūbu > Toyama Prefecture
- Taiwan (0.04)
- Japan > Honshū
- North America > United States
- California
- Sonoma County > Petaluma (0.04)
- Los Angeles County
- Compton (0.04)
- Inglewood (0.04)
- Los Angeles > Hollywood (0.04)
- Butte County > Oroville (0.04)
- Santa Clara County
- Palo Alto (0.04)
- Santa Clara (0.04)
- Contra Costa County
- Concord (0.04)
- Martinez (0.04)
- Walnut Creek (0.04)
- San Francisco County > San Francisco (0.04)
- Stanislaus County > Modesto (0.04)
- San Diego County
- San Mateo County > Redwood City (0.04)
- Alameda County
- San Bernardino County > Victorville (0.04)
- Marin County > Novato (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- California
- South America > Chile
- Asia
- Genre:
- Research Report (0.63)
- Industry:
- Banking & Finance > Real Estate (0.33)
- Information Technology (0.46)
- Leisure & Entertainment (0.46)
- Technology: