ExploringtheLimitsofDomain-AdaptiveTrainingfor DetoxifyingLarge-ScaleLanguageModels

Feb-12-2026, 14:18:02 GMT–Neural Information Processing Systems

Wethen comprehensively study detoxifying LMswithparameter sizesranging from126Mupto530B(3 largerthanGPT3), a scale that has never been studied before. We find thati) large LMs have similar toxicity levels as smaller ones given the same pre-training corpus, and ii) large LMs require more endeavor to unlearn the toxic content seen at pretraining. Wealso explore parameter-efficient training methods fordetoxification.

large language model, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Feb-12-2026, 14:18:02 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Illinois (0.04)
  - California (0.04)

Genre:
- Research Report (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Natural Language > Large Language Model (0.35)

Duplicate Docs Excel Report

Title
e8c20cafe841cba3e31a17488dc9c3f1-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found