Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes
Christ, Bryan R., Gottesman, Zack, Kropko, Jonathan, Hartvigsen, Thomas
–arXiv.org Artificial Intelligence
Math reasoning is a highly active area of Large Language Model (LLM) research because it is a hallmark of artificial intelligence. However, few works have explored how math reasoning is encoded within LLM parameters and if it is a skill that can be isolated within a model. Doing so could allow targeted intervention to improve math performance without altering non-math behavior and foster understanding of how models encode math reasoning. We introduce Math Neurosurgery (MathNeuro), a method for isolating math-specific parameters in LLMs using only forward passes. MathNeuro builds on existing work by using weights and activations to calculate parameter importance, but isolates math-specific parameters by removing those important for general language tasks. Pruning parameters MathNeuro identifies deletes a LLM's math reasoning ability without destroying its general language ability. Scaling these parameters by a small constant improves a pretrained or instruction-tuned LLM's performance by 4-17% on GSM8K while leaving non-math behavior unaltered. MathNeuro is also data efficient: most of its effectiveness holds when identifying math-specific parameters using a single sample. MathNeuro highlights the potential for future work to intervene on math-specific parameters.
arXiv.org Artificial Intelligence
Oct-22-2024
- Country:
- Asia
- China
- Guangxi Province > Nanning (0.04)
- Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Singapore (0.04)
- China
- Europe
- Denmark > Capital Region
- Copenhagen (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- North America
- Mexico (0.04)
- United States > Virginia (0.04)
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Health & Medicine > Therapeutic Area
- Neurology (1.00)
- Psychiatry/Psychology (1.00)
- Health & Medicine > Therapeutic Area
- Technology: