Fair Text Classification via Transferable Representations

Leteno, Thibaud, Perrot, Michael, Laclau, Charlotte, Gourru, Antoine, Gravier, Christophe

Mar-10-2025–arXiv.org Artificial Intelligence

Group fairness is a central research topic in text classification, where reaching fair treatment between sensitive groups (e.g., women and men) remains an open challenge. We propose an approach that extends the use of the Wasserstein Dependency Measure for learning unbiased neural text classifiers. Given the challenge of distinguishing fair from unfair information in a text encoder, we draw inspiration from adversarial training by inducing independence between representations learned for the target label and those for a sensitive attribute. We further show that Domain Adaptation can be efficiently leveraged to remove the need for access to the sensitive attributes in the dataset we cure. We provide both theoretical and empirical evidence that our approach is well-founded.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Mar-10-2025

arXiv.org PDF

Add feedback

Country:
- Europe (1.00)
- North America
  - Mexico > Mexico City (0.14)
  - United States (1.00)

Genre:
- Overview (0.93)
- Research Report > New Finding (0.46)

Industry:
- Government > Regional Government (0.93)
- Law (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Computational Learning Theory (0.67)
    - Neural Networks (1.00)
    - Statistical Learning (0.93)
  - Natural Language
    - Large Language Model (0.67)
    - Text Classification (0.61)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found