SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents

Open in new window