A Human Evaluation Details A.1 Unlearning Toxicity Human Eval Details
–Neural Information Processing Systems
In total we have 1200 comparisons, and each comparison is rated by 3 raters. In total we have 2400 comparisons, and each comparison is rated by 3 raters. These were: 1. Coherence: Is the system's generation aligned in meaning and topic with the prompt? We sampled 100 prompts randomly from the corpus, and then evaluated 19 different algorithms. HITs was 2.2K, and the total number of ratings was 6.6K.
Neural Information Processing Systems
Aug-17-2025, 19:39:47 GMT
- Country:
- Africa > North Africa (0.04)
- Asia
- Afghanistan (0.04)
- Middle East > Iraq (0.04)
- North America
- Mexico (0.04)
- United States (1.00)
- Industry:
- Government
- Military > Army (0.47)
- Regional Government > North America Government
- United States Government (0.69)
- Law (1.00)
- Government
- Technology: