STAIR: Improving Safety Alignment with Introspective Reasoning

Open in new window