Self-Boosting Large Language Models with Synthetic Preference Data

Open in new window