WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
He, Guanzhong, Yang, Zhen, Liu, Jinxin, Xu, Bin, Hou, Lei, Li, Juanzi
–arXiv.org Artificial Intelligence
Search agents have achieved significant advancements in enabling intelligent information retrieval and decision-making within interactive environments. Although reinforcement learning has been employed to train agentic models capable of more dynamic interactive retrieval, existing methods are limited by shallow tool-use depth and the accumulation of errors over multiple iterative interactions. In this paper, we present WebSeer, a more intelligent search agent trained via reinforcement learning enhanced with a self-reflection mechanism. Specifically, we construct a large dataset annotated with reflection patterns and design a two-stage training framework that unifies cold start and reinforcement learning within the self-reflection paradigm for real-world web-based environments, which enables the model to generate longer and more reflective tool-use trajectories. Our approach substantially extends tool-use chains and improves answer accuracy. Using a single 14B model, we achieve state-of-the-art results on HotpotQA and SimpleQA, with accuracies of 72.3% and 90.0%, respectively, and demonstrate strong generalization to out-of-distribution datasets. The code is available at https://github.com/99hgz/WebSeer
arXiv.org Artificial Intelligence
Oct-22-2025
- Country:
- Asia
- Middle East > Jordan (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- France (0.04)
- Germany > Saarland
- Saarbrücken (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- Tennessee > Shelby County
- Memphis (0.04)
- Michigan > Wayne County
- Detroit (0.04)
- Colorado (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Missouri > Jackson County
- Kansas City (0.04)
- Illinois > Cook County
- Chicago (0.05)
- New York (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- California > San Francisco County
- San Francisco (0.04)
- Minnesota (0.04)
- Tennessee > Shelby County
- Canada > Ontario
- Asia
- Genre:
- Research Report (0.83)
- Industry:
- Leisure & Entertainment > Sports > Baseball (1.00)
- Technology: