Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs

Open in new window