RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision