Emergent bartering behaviour in multi-agent reinforcement learning