Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Jun-22-2026, 13:48:53 GMT–Neural Information Processing Systems

Reinforcement learning (RL) yields substantial improvements in large language models' (LLMs) downstream task performance and alignment with human values. Surprisingly, such large gains result from updating only a small subnetwork comprising just 5%-30% of the parameters, with the rest effectively unchanged. We refer to this phenomenon as parameter update sparsity induced by RL. It is observed across all 7 widely-used RL algorithms (e.g., PPO, GRPO, DPO) and all 10 LLMs from different families in our experiments. This sparsity occurs without any explicit sparsity-promoting regularizations or architectural constraints.

large language model, machine learning, sparsity, (21 more...)

Neural Information Processing Systems

Jun-22-2026, 13:48:53 GMT

Conferences PDF

Add feedback

Country:
- North America
  - Mexico (0.28)
  - United States > Illinois (0.14)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found