Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers

Open in new window