Evidence of conceptual mastery in the application of rules by Large Language Models

Nunes, José Luiz, Almeida, Guilherme FCF, Flanagan, Brian

Mar-2-2025–arXiv.org Artificial Intelligence

In this paper we leverage psychological methods to investigate LLMs' conceptual mastery in applying rules. We introduce a novel procedure to match the diversity of thought generated by LLMs to that observed in a human sample. We then conducted two experiments comparing rule-based decision-making in humans and LLMs. Study 1 found that all investigated LLMs replicated human patterns regardless of whether they are prompted with scenarios created before or after their training cut-off. Moreover, we found unanticipated differences between the two sets of scenarios among humans. Surprisingly, even these differences were replicated in LLM responses. Study 2 turned to a contextual feature of human rule application: under forced time delay, human samples rely more heavily on a rule's text than on other considerations such as a rule's purpose.. Our results revealed that some models (Gemini Pro and Claude 3) responded in a human-like manner to a prompt describing either forced delay or time pressure, while others (GPT-4o and Llama 3.2 90b) did not. We argue that the evidence gathered suggests that LLMs have mastery over the concept of rule, with implications for both legal decision making and philosophical inquiry.

llm, participant, rule violation judgment, (13 more...)

arXiv.org Artificial Intelligence

Mar-2-2025

arXiv.org PDF

Add feedback

Country:
- Africa > Rwanda (0.04)
- North America > United States
  - Iowa (0.04)
  - Massachusetts > Middlesex County
    - Cambridge (0.04)
  - Illinois > Cook County
    - Chicago (0.04)
  - California > San Francisco County
    - San Francisco (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.14)
    - Cambridgeshire > Cambridge (0.04)
- Asia > Thailand
  - Bangkok > Bangkok (0.04)

Genre:
- Research Report
  - New Finding (0.34)
  - Promising Solution (0.34)

Industry:
- Law (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found