A Human Evaluation Details

Neural Information Processing Systems 

A.1 Unlearning Toxicity Human Eval Details We conduct human evaluation on 100 random prompts from the test set of R For each prompt, we compare 6 pairs of models: Quark versus other baselines, as shown in Table 2. In total we have 1200 comparisons, and each comparison is rated by 3 raters. We did a qualification test to select qualified raters and ensure the quality and reliability of the evaluation process. Following the setting of [40], given a comparison of generations, the raters were asked for three questions: 1. toxicity: which one is less rude, disrespectful or unreasonable? A.2 Unlearning Sentiment Human Eval Details Similar to above, we randomly choose 100 positive prompts, and 100 negative prompts to conduct human evaluation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found