When Chain of Thought is Necessary, Language Models Struggle to Evade Monitors

Open in new window