Scalable regret for learning to control network-coupled subsystems with unknown dynamics