Response-Level Rewards Are All You Need for Online Reinforcement Learning in LLMs: A Mathematical Perspective