SafeConstellations: Steering LLM Safety to Reduce Over-Refusals Through Task-Specific Trajectory

Open in new window