Evidence of conceptual mastery in the application of rules by Large Language Models
Nunes, José Luiz, Almeida, Guilherme FCF, Flanagan, Brian
–arXiv.org Artificial Intelligence
In this paper we leverage psychological methods to investigate LLMs' conceptual mastery in applying rules. We introduce a novel procedure to match the diversity of thought generated by LLMs to that observed in a human sample. We then conducted two experiments comparing rule-based decision-making in humans and LLMs. Study 1 found that all investigated LLMs replicated human patterns regardless of whether they are prompted with scenarios created before or after their training cut-off. Moreover, we found unanticipated differences between the two sets of scenarios among humans. Surprisingly, even these differences were replicated in LLM responses. Study 2 turned to a contextual feature of human rule application: under forced time delay, human samples rely more heavily on a rule's text than on other considerations such as a rule's purpose.. Our results revealed that some models (Gemini Pro and Claude 3) responded in a human-like manner to a prompt describing either forced delay or time pressure, while others (GPT-4o and Llama 3.2 90b) did not. We argue that the evidence gathered suggests that LLMs have mastery over the concept of rule, with implications for both legal decision making and philosophical inquiry.
arXiv.org Artificial Intelligence
Mar-2-2025
- Country:
- Africa > Rwanda (0.04)
- Asia > Thailand
- Europe
- France (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.14)
- North America > United States
- California > San Francisco County
- San Francisco (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Iowa (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California > San Francisco County
- Genre:
- Research Report
- New Finding (0.34)
- Promising Solution (0.34)
- Research Report
- Industry:
- Law (1.00)
- Technology: