Linear Correlation in LM's Compositional Generalization and Hallucination

Peng, Letian, An, Chenyang, Hao, Shibo, Dong, Chengyu, Shang, Jingbo

Feb-6-2025–arXiv.org Artificial Intelligence

The generalization of language models (LMs) is undergoing active debates, contrasting their potential for general intelligence with their struggles with basic knowledge composition (e.g., reverse/transition curse). This paper uncovers the phenomenon of linear correlations in LMs during knowledge composition. For explanation, there exists a linear transformation between certain related knowledge that maps the next token prediction logits from one prompt to another, e.g., "X lives in the city of" $\rightarrow$ "X lives in the country of" for every given X. This mirrors the linearity in human knowledge composition, such as Paris $\rightarrow$ France. Our findings indicate that the linear transformation is resilient to large-scale fine-tuning, generalizing updated knowledge when aligned with real-world relationships, but causing hallucinations when it deviates. Empirical results suggest that linear correlation can serve as a potential identifier of LM's generalization. Finally, we show such linear correlations can be learned with a single feedforward network and pre-trained vocabulary representations, indicating LM generalization heavily relies on the latter.

correlation, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Feb-6-2025

arXiv.org PDF

Add feedback

Country:
- South America
  - Uruguay (0.04)
  - Venezuela (0.04)
  - Peru (0.04)
  - Colombia > Meta Department
    - Villavicencio (0.04)
  - Chile > Santiago Metropolitan Region
    - Santiago Province > Santiago (0.04)
- Oceania
  - Samoa (0.04)
  - Fiji (0.04)
  - Australia > Australian Capital Territory
    - Canberra (0.04)
- North America
  - Nicaragua (0.04)
  - Honduras (0.04)
  - Guatemala (0.04)
  - Dominican Republic (0.04)
  - United States
    - Wisconsin > Milwaukee County
      - Milwaukee (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Indiana > Marion County
      - Indianapolis (0.04)
    - Illinois > Cook County
      - Chicago (0.06)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Cuba > La Habana Province
    - Havana (0.04)
  - Canada
    - Ontario > Toronto (0.04)
    - Manitoba > Winnipeg Metropolitan Region
      - Winnipeg (0.06)
- Europe
  - Austria > Vienna (0.14)
  - Kosovo (0.04)
  - Romania (0.04)
  - Iceland (0.04)
  - Switzerland (0.04)
  - Albania (0.04)
  - Slovenia (0.04)
  - Serbia (0.04)
  - North Macedonia (0.04)
  - Sweden > Stockholm
    - Stockholm (0.06)
  - Russia > Central Federal District
    - Moscow Oblast > Moscow (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.05)
  - Hungary > Budapest
    - Budapest (0.04)
  - Germany
    - Saxony > Dresden (0.04)
    - North Rhine-Westphalia > Upper Bavaria
      - Munich (0.04)
    - Baden-Württemberg > Stuttgart Region
      - Stuttgart (0.05)
  - Spain
    - Galicia > Madrid (0.04)
    - Valencian Community > Valencia Province
      - Valencia (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Norway > Eastern Norway
    - Oslo (0.04)
  - Finland > Uusimaa
    - Helsinki (0.04)
  - Middle East > Republic of Türkiye
    - Istanbul Province > Istanbul (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - United Kingdom
    - Scotland > City of Edinburgh
      - Edinburgh (0.04)
    - England > Leicestershire
      - Leicester (0.07)
  - Poland > Masovia Province
    - Warsaw (0.04)
- Asia
  - Russia (0.04)
  - Brunei (0.04)
  - Myanmar (0.04)
  - South Korea (0.04)
  - Laos (0.04)
  - Bangladesh (0.04)
  - Malaysia (0.04)
  - Mongolia (0.04)
  - Cambodia (0.04)
  - Indonesia > Java
    - Jakarta > Jakarta (0.04)
  - India
    - Maharashtra (0.06)
    - Rajasthan (0.04)
    - Karnataka > Bengaluru (0.04)
  - Thailand > Bangkok
    - Bangkok (0.05)
  - North Korea > Pyongyang
    - Pyongyang (0.04)
  - China
    - Shanghai > Shanghai (0.05)
    - Beijing > Beijing (0.04)
  - Middle East
    - Israel (0.04)
    - Jordan (0.04)
    - Oman (0.04)
    - Bahrain (0.04)
    - Saudi Arabia > Riyadh Province
      - Riyadh (0.04)
    - Syria > Aleppo Governorate
      - Aleppo (0.04)
    - Iraq
      - Nineveh Governorate > Mosul (0.04)
      - Baghdad Governorate > Baghdad (0.04)
    - UAE
      - Dubai Emirate > Dubai (0.04)
      - Abu Dhabi Emirate > Abu Dhabi (0.04)
    - Republic of Türkiye
      - Istanbul Province > Istanbul (0.04)
      - Ankara Province > Ankara (0.04)
    - Iran > Tehran Province
      - Tehran (0.04)
  - Afghanistan > Kabul Province
    - Kabul (0.04)
  - Singapore > Central Region
    - Singapore (0.04)
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture
      - Tokyo (0.04)
    - Kansai > Kyoto Prefecture
      - Kyoto (0.04)
  - Pakistan
    - Islamabad Capital Territory > Islamabad (0.05)
    - Sindh > Karachi Division
      - Karachi (0.04)
- Africa
  - Sudan (0.04)
  - Rwanda (0.04)
  - Nigeria (0.04)
  - Liberia (0.04)
  - Ghana (0.04)
  - South Africa > Gauteng
    - Johannesburg (0.06)
  - Middle East
    - Tunisia (0.04)
    - Libya (0.04)
  - Kenya > Nairobi City County
    - Nairobi (0.05)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning (0.93)
  - Machine Learning > Neural Networks
    - Deep Learning (0.70)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found