Agentic RL Scaling Law: Spontaneous Code Execution for Mathematical Problem Solving

Open in new window