RELiQ: Scalable Entanglement Routing via Reinforcement Learning in Quantum Networks