A Human Evaluation Details A.1 Unlearning Toxicity Human Eval Details

Aug-17-2025, 19:39:47 GMT–Neural Information Processing Systems

In total we have 1200 comparisons, and each comparison is rated by 3 raters. In total we have 2400 comparisons, and each comparison is rated by 3 raters. These were: 1. Coherence: Is the system's generation aligned in meaning and topic with the prompt? We sampled 100 prompts randomly from the corpus, and then evaluated 19 different algorithms. HITs was 2.2K, and the total number of ratings was 6.6K.

artificial intelligence, lieutenant colonel, machine learning, (12 more...)

Neural Information Processing Systems

Aug-17-2025, 19:39:47 GMT

Conferences PDF

Add feedback

Country:
- Africa > North Africa (0.04)
- North America
  - United States (1.00)
  - Mexico (0.04)
- Asia
  - Middle East > Iraq (0.04)
  - Afghanistan (0.04)

Industry:
- Law (1.00)
- Government
  - Military > Army (0.47)
  - Regional Government > North America Government
    - United States Government (0.69)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.98)

Duplicate Docs Excel Report

Title
b125999bde7e80910cbdbd323087df8f-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found