Emergent Resource Exchange and Tolerated Theft Behavior using Multi-Agent Reinforcement Learning