LookAhead Tuning: Safer Language Models via Partial Answer Previews

Open in new window