Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Jun-14-2026, 02:51:42 GMT–Neural Information Processing Systems

Reinforcement learning (RL) yields substantial improvements in large language models' (LLMs) downstream task performance and alignment with human values. Surprisingly, such large gains result from updating only a small subnetwork comprising just 5%-30% of the parameters, with the rest effectively unchanged. We refer to this phenomenon as parameter update sparsity induced by RL. It is observed across all 7 widely-used RL algorithms (e.g., PPO, GRPO, DPO) and all 10 LLMs from different families in our experiments. This sparsity is intrinsic and occurs without any explicit sparsity-promoting regularizations or architectural constraints.

large language model, machine learning, reinforcement learning, (10 more...)

Neural Information Processing Systems

Jun-14-2026, 02:51:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.85)
  - Machine Learning > Reinforcement Learning (0.65)