REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites

Open in new window