A Human Evaluation Details
–Neural Information Processing Systems
A.1 Unlearning Toxicity Human Eval Details We conduct human evaluation on 100 random prompts from the test set of R For each prompt, we compare 6 pairs of models: Quark versus other baselines, as shown in Table 2. In total we have 1200 comparisons, and each comparison is rated by 3 raters. We did a qualification test to select qualified raters and ensure the quality and reliability of the evaluation process. Following the setting of [40], given a comparison of generations, the raters were asked for three questions: 1. toxicity: which one is less rude, disrespectful or unreasonable? A.2 Unlearning Sentiment Human Eval Details Similar to above, we randomly choose 100 positive prompts, and 100 negative prompts to conduct human evaluation.
Neural Information Processing Systems
Feb-10-2025, 03:19:24 GMT