MPO: Multilingual Safety Alignment via Reward Gap Optimization

Open in new window