BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models

Open in new window