Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text

Healey, Jennifer, Byrum, Laurie, Akhtar, Md Nadeem, Bhargava, Surabhi, Sinha, Moumita

May-7-2025–arXiv.org Artificial Intelligence

LLM evaluation is challenging even the case of base models. In real world deployments, evaluation is further complicated by th e interplay of task specific prompts and experiential context. A t scale, bias evaluation is often based on short context, fixed choicebench-marks that can be rapidly evaluated, however, these can lose validity when the LLMs' deployed context differs. Large scale h u-man evaluation is often seen as too intractable and costly. H ere we present our journey towards developing a semi-automatedbias evaluation framework for free text responses that has human insights at its core. We discuss how we developed an operational definition of bias that helped us automate our pipeline and a methodology for classifying bias beyond multiple choice. We additionally comment on how human evaluation helped us uncover problematic templates in a bias benchmark.

computational linguistic, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

May-7-2025

arXiv.org PDF

Add feedback

Country:
- South America > Colombia
  - Meta Department > Villavicencio (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - District of Columbia > Washington (0.05)
    - Washington > King County
      - Seattle (0.14)
    - New York > New York County
      - New York City (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
    - California > Santa Clara County
      - San Jose (0.05)
  - Mexico > Mexico City
    - Mexico City (0.04)
- Europe
  - Switzerland (0.04)
  - Monaco (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Italy > Tuscany
    - Florence (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Croatia > Dubrovnik-Neretva County
    - Dubrovnik (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.50)
- Overview (0.46)

Technology:
- Information Technology
  - Human Computer Interaction (1.00)
  - Artificial Intelligence > Natural Language
    - Large Language Model (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found