No LLM is Free From Bias: A Comprehensive Study of Bias Evaluation in Large Language models

Kumar, Charaka Vinayak, Urlana, Ashok, Kanumolu, Gopichand, Garlapati, Bala Mallikarjunarao, Mishra, Pruthwik

Mar-14-2025–arXiv.org Artificial Intelligence

Advancements in Large Language Models (LLMs) have increased the performance of different natural language understanding as well as generation tasks. Although LLMs have breached the state-of-the-art performance in various tasks, they often reflect different forms of bias present in the training data. In the light of this perceived limitation, we provide a unified evaluation of benchmarks using a set of representative LLMs that cover different forms of biases starting from physical characteristics to socio-economic categories. Moreover, we propose five prompting approaches to carry out the bias detection task across different aspects of bias. Further, we formulate three research questions to gain valuable insight in detecting biases in LLMs using different approaches and evaluation metrics across benchmarks. The results indicate that each of the selected LLMs suffer from one or the other form of bias with the LLaMA3.1-8B model being the least biased. Finally, we conclude the paper with the identification of key challenges and possible future directions.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Mar-14-2025

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- South America
  - Venezuela (0.04)
  - Peru (0.04)
  - Chile (0.04)
  - Brazil (0.04)
- North America
  - Panama (0.04)
  - Mexico (0.04)
  - Honduras (0.04)
  - Haiti (0.04)
  - Dominican Republic (0.04)
  - Canada (0.04)
  - United States
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
- Europe
  - Poland (0.04)
  - Spain (0.04)
  - Iceland (0.04)
  - Switzerland (0.04)
  - Denmark (0.04)
  - Finland (0.04)
  - Norway (0.04)
  - Slovakia (0.04)
  - France (0.04)
  - Italy (0.04)
  - Russia (0.04)
  - Greece (0.04)
  - Lithuania (0.04)
  - Romania (0.04)
  - Belgium (0.04)
  - Sweden (0.04)
  - United Kingdom (0.04)
  - Kosovo (0.04)
  - Ireland (0.04)
  - Moldova (0.04)
  - Portugal (0.04)
  - Hungary (0.04)
  - Middle East > Malta
    - Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Asia
  - Singapore (0.04)
  - Sri Lanka (0.04)
  - Myanmar (0.04)
  - China (0.04)
  - Russia (0.04)
  - Indonesia (0.04)
  - Japan (0.04)
  - Vietnam (0.04)
  - Bangladesh (0.04)
  - Uzbekistan (0.04)
  - Mongolia (0.04)
  - Pakistan (0.04)
  - India > Telangana
    - Hyderabad (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Middle East
    - Israel (0.04)
    - Saudi Arabia (0.04)
    - Iran (0.04)
    - Yemen (0.04)
    - Syria (0.04)
    - Iraq (0.04)
    - Palestine (0.04)
    - Republic of Türkiye (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.14)
- Africa
  - Sudan (0.04)
  - Nigeria (0.04)
  - Namibia (0.04)
  - Mozambique (0.04)
  - Mali (0.04)
  - Ethiopia (0.04)
  - Eritrea (0.04)
  - Middle East
    - Somalia (0.04)
    - Morocco (0.04)
    - Libya (0.04)

Genre:
- Research Report > New Finding (0.88)

Industry:
- Leisure & Entertainment (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found