Linear Correlation in LM's Compositional Generalization and Hallucination
Peng, Letian, An, Chenyang, Hao, Shibo, Dong, Chengyu, Shang, Jingbo
–arXiv.org Artificial Intelligence
The generalization of language models (LMs) is undergoing active debates, contrasting their potential for general intelligence with their struggles with basic knowledge composition (e.g., reverse/transition curse). This paper uncovers the phenomenon of linear correlations in LMs during knowledge composition. For explanation, there exists a linear transformation between certain related knowledge that maps the next token prediction logits from one prompt to another, e.g., "X lives in the city of" $\rightarrow$ "X lives in the country of" for every given X. This mirrors the linearity in human knowledge composition, such as Paris $\rightarrow$ France. Our findings indicate that the linear transformation is resilient to large-scale fine-tuning, generalizing updated knowledge when aligned with real-world relationships, but causing hallucinations when it deviates. Empirical results suggest that linear correlation can serve as a potential identifier of LM's generalization. Finally, we show such linear correlations can be learned with a single feedforward network and pre-trained vocabulary representations, indicating LM generalization heavily relies on the latter.
arXiv.org Artificial Intelligence
Feb-6-2025
- Country:
- South America
- Uruguay (0.04)
- Venezuela (0.04)
- Peru (0.04)
- Colombia > Meta Department
- Villavicencio (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Oceania
- Samoa (0.04)
- Fiji (0.04)
- Australia > Australian Capital Territory
- Canberra (0.04)
- North America
- Nicaragua (0.04)
- Honduras (0.04)
- Guatemala (0.04)
- Dominican Republic (0.04)
- United States
- Wisconsin > Milwaukee County
- Milwaukee (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Indiana > Marion County
- Indianapolis (0.04)
- Illinois > Cook County
- Chicago (0.06)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California > San Diego County
- San Diego (0.04)
- Wisconsin > Milwaukee County
- Cuba > La Habana Province
- Havana (0.04)
- Canada
- Ontario > Toronto (0.04)
- Manitoba > Winnipeg Metropolitan Region
- Winnipeg (0.06)
- Europe
- Austria > Vienna (0.14)
- Kosovo (0.04)
- Romania (0.04)
- Iceland (0.04)
- Switzerland (0.04)
- Albania (0.04)
- Slovenia (0.04)
- Serbia (0.04)
- North Macedonia (0.04)
- Sweden > Stockholm
- Stockholm (0.06)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Netherlands > North Holland
- Amsterdam (0.05)
- Hungary > Budapest
- Budapest (0.04)
- Germany
- Saxony > Dresden (0.04)
- North Rhine-Westphalia > Upper Bavaria
- Munich (0.04)
- Baden-Württemberg > Stuttgart Region
- Stuttgart (0.05)
- Spain
- Galicia > Madrid (0.04)
- Valencian Community > Valencia Province
- Valencia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- France > Île-de-France
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Leicestershire
- Leicester (0.07)
- Scotland > City of Edinburgh
- Poland > Masovia Province
- Warsaw (0.04)
- Asia
- Russia (0.04)
- Brunei (0.04)
- Myanmar (0.04)
- South Korea (0.04)
- Laos (0.04)
- Bangladesh (0.04)
- Malaysia (0.04)
- Mongolia (0.04)
- Cambodia (0.04)
- Indonesia > Java
- India
- Maharashtra (0.06)
- Rajasthan (0.04)
- Karnataka > Bengaluru (0.04)
- Thailand > Bangkok
- Bangkok (0.05)
- North Korea > Pyongyang
- Pyongyang (0.04)
- China
- Middle East
- Israel (0.04)
- Jordan (0.04)
- Oman (0.04)
- Bahrain (0.04)
- Saudi Arabia > Riyadh Province
- Riyadh (0.04)
- Syria > Aleppo Governorate
- Aleppo (0.04)
- Iraq
- Nineveh Governorate > Mosul (0.04)
- Baghdad Governorate > Baghdad (0.04)
- UAE
- Dubai Emirate > Dubai (0.04)
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Ankara Province > Ankara (0.04)
- Iran > Tehran Province
- Tehran (0.04)
- Afghanistan > Kabul Province
- Kabul (0.04)
- Singapore > Central Region
- Singapore (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Kansai > Kyoto Prefecture
- Kyoto (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Pakistan
- Islamabad Capital Territory > Islamabad (0.05)
- Sindh > Karachi Division
- Karachi (0.04)
- Africa
- Sudan (0.04)
- Rwanda (0.04)
- Nigeria (0.04)
- Liberia (0.04)
- Ghana (0.04)
- South Africa > Gauteng
- Johannesburg (0.06)
- Middle East
- Kenya > Nairobi City County
- Nairobi (0.05)
- South America
- Genre:
- Research Report > New Finding (1.00)
- Technology: