Preference-grounded Token-level Guidance for Language Model Fine-tuning Shentao Yang

Neural Information Processing Systems 

Aligning language models (LMs) with preferences is an important problem in natural language generation.