Multi-Reference Preference Optimization for Large Language Models

Open in new window