Linear Latent World Models in Simple Transformers: A Case Study on Othello-GPT

Open in new window