Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems