Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity

Jun-17-2023–arXiv.org Artificial Intelligence

Large Language Models (LLMs) have demonstrated impressive capabilities in generating fluent text, as well as tendencies to reproduce undesirable social biases. This study investigates whether LLMs reproduce the moral biases associated with political groups in the United States, an instance of a broader capability herein termed moral mimicry. This hypothesis is explored in the GPT-3/3.5 and OPT families of Transformer-based LLMs. Using tools from Moral Foundations Theory, it is shown that these LLMs are indeed moral mimics. When prompted with a liberal or conservative political identity, the models generate text reflecting corresponding moral biases. This study also explores the relationship between moral mimicry and model size, and similarity between human and LLM moral word use.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Jun-17-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia (0.04)
- North America
  - Dominican Republic (0.04)
  - United States
    - Washington > King County
      - Seattle (0.04)
    - New York > New York County
      - New York City (0.04)
- Europe
  - Croatia (0.04)
  - Austria (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
- Asia > Middle East
  - Republic of Türkiye (0.04)
  - Israel (0.04)
  - UAE > Abu Dhabi Emirate
    - Abu Dhabi (0.14)

Genre:
- Research Report > New Finding (0.71)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.54)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found