Reliable Weak-to-Strong Monitoring of LLM Agents

Open in new window