On the definition of toxicity in NLP
Berezin, Sergey, Farahbakhsh, Reza, Crespi, Noel
–arXiv.org Artificial Intelligence
The fundamental problem in toxicity detection task lies in the fact that the toxicity is ill-defined. This causes us to rely on subjective and vague data in models' training, which results in non-robust and non-accurate results: garbage in - garbage out. This work suggests a new, stress-level-based definition of toxicity designed to be objective and context-aware. On par with it, we also describe possible ways of applying this new definition to dataset creation and model training.
arXiv.org Artificial Intelligence
Oct-19-2023
- Country:
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Tuscany
- Florence (0.05)
- France > Provence-Alpes-Côte d'Azur
- Europe
- Genre:
- Research Report (0.40)
- Industry:
- Health & Medicine (0.95)
- Technology: