Multimodal Reinforcement Learning with Agentic Verifier for AI Agents