Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems