A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis

Open in new window