Interpreting Arithmetic Reasoning in Large Language Models using Game-Theoretic Interactions

Open in new window