End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning