WebChoreArena: Evaluating Web Browsing Agents on Realistic Tedious Web Tasks

Open in new window