Simulation-Driven Reinforcement Learning in Queuing Network Routing Optimization