Biases for Emergent Communication in Multi-agent Reinforcement Learning