From Hard Refusals to Safe-Completions: Toward Output-Centric Safety Training

Open in new window