OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning

Open in new window