Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains