SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents