Preference-grounded Token-level Guidance for Language Model Fine-tuning

Open in new window