Process-Supervised Reinforcement Learning for Code Generation