SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment

Open in new window