metric

Apr-25-2026, 23:44:35 GMT–Neural Information Processing Systems

Dynabench comprises four dynamic tasks with multiple rounds of datasets that will grow over time. Given that here we have to be able to evaluate a wide variety of models, both in the loop and outside of it, we employ a black box post hoc approach, i.e., one that can be applied post-data collection to existing data, on any uploaded model, without requiring anything other than its predictions. One straightforward way to measure fairness then, is to apply clearly delimited, heuristic perturbations to existing evaluation datasets, and measure whether performance drops. Such an approach is similar to recent works that use grammars to heuristically generate pairs of examples varying in gender [58] and/or race [67] in that they utilize predefined lists of words. However, because we also want to ensure minimal consequences on our classification labels, we adopted an approach that is more targeted than grammars and also preserves the original input data distribution: we replace each word in the input data that has a clear signal about race/ethnicity and/or gender identity with a similar word referring to another group, rerun inference, and measure how many labels flipped (i.e., the difference in microaverage accuracy).

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Apr-25-2026, 23:44:35 GMT

Conferences PDF

Add feedback

Industry:
- Transportation (0.37)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (0.98)
  - Natural Language (0.73)

Duplicate Docs Excel Report

Title
55b1927fdafef39c48e5b73b5d61ea60-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found