Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains