RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

Open in new window