Real-time system optimal traffic routing under uncertainties -- Can physics models boost reinforcement learning?