Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimization

Open in new window