Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments

Linhardt, Lorenz, Neuhäuser, Tom, Tětková, Lenka, Eberle, Oliver

Apr-11-2025–arXiv.org Artificial Intelligence

Small and mid-sized generative language models have gained increasing attention. Their size and availability make them amenable to being analyzed at a behavioral as well as a representational level, allowing investigations of how these levels interact. We evaluate 32 publicly available language models for their representational and behavioral alignment with human similarity judgments on a word triplet task. This provides a novel evaluation setting to probe semantic associations in language beyond common pairwise comparisons. We find that (1) even the representations of small language models can achieve human-level alignment, (2) instruction-tuned model variants can exhibit substantially increased agreement, (3) the pattern of alignment across layers is highly model dependent, and (4) alignment based on models' behavioral responses is highly dependent on model size, matching their representational alignment only for the largest evaluated models.

large language model, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

Apr-11-2025

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - Jordan (0.04)
- Europe
  - Denmark > Capital Region
    - Kongens Lyngby (0.04)
  - Germany (0.04)
  - Italy > Calabria
    - Catanzaro Province > Catanzaro (0.04)
  - Middle East > Malta
    - Eastern Region > Northern Harbour District > St. Julian's (0.04)
- North America
  - Canada > Ontario
    - Toronto (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - United States
    - Colorado > Boulder County
      - Boulder (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)

Genre:
- Research Report (0.83)

Industry:
- Health & Medicine > Therapeutic Area > Neurology (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)
  - Natural Language
    - Large Language Model (0.96)
    - Text Processing (1.00)
  - Representation & Reasoning (1.00)