Linear Latent World Models in Simple Transformers: A Case Study on Othello-GPT

Hazineh, Dean S., Zhang, Zechen, Chiu, Jeffery

Oct-12-2023–arXiv.org Artificial Intelligence

Foundation models exhibit significant capabilities in decision-making and logical deductions. Nonetheless, a continuing discourse persists regarding their genuine understanding of the world as opposed to mere stochastic mimicry. This paper meticulously examines a simple transformer trained for Othello, extending prior research to enhance comprehension of the emergent world model of Othello-GPT. The investigation reveals that Othello-GPT encapsulates a linear representation of opposing pieces, a factor that causally steers its decision-making process. This paper further elucidates the interplay between the linear world representation and causal decision-making, and their dependence on layer depth and model complexity. We have made the code public.

linear latent world model, othello-gpt, simple transformer, (1 more...)

arXiv.org Artificial Intelligence

Oct-12-2023

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.40)

Industry:
- Media > Theater (1.00)
- Leisure & Entertainment (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.60)
  - Cognitive Science > Problem Solving (0.60)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found