Incentivizing the Emergence of Grounded Discrete Communication Between General Agents
–arXiv.org Artificial Intelligence
We converted the recently developed BabyAI grid world platform to a sender/receiver setup in order to test the hypothesis that established deep reinforcement learning techniques are sufficient to incentivize the emergence of a grounded discrete communication protocol between general agents. This is in contrast to previous experiments that employed straight-through estimation or tailored inductive biases. Our results show that these can indeed be avoided, by instead providing proper environmental incentives. Moreover, they show that a longer interval between communications in-centivized more abstract semantics. In some cases, the communicating agents adapted to new environments more quickly than monolithic agents, showcasing the potential of emergent discrete communication for transfer learning.
arXiv.org Artificial Intelligence
Jan-6-2020
- Country:
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain > Catalonia
- Europe
- Genre:
- Research Report > New Finding (0.68)
- Technology: