Configurable Preference Tuning with Rubric-Guided Synthetic Data

Open in new window