BEARCUBS: A benchmark for computer-using web agents

Open in new window