Preference Tuning For Toxicity Mitigation Generalizes Across Languages

Open in new window