RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs