Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning

Open in new window