Reviews: Emergence of Language with Multi-agent Games: Learning to Communicate with Sequences of Symbols
–Neural Information Processing Systems
Increasing my score based on the authors rebuttal. The argument that the proposed method can complement human-bot training makes sense. Also, it seems RL baseline experiments were exhaustive. But the argument about the learnt language being compositional should be toned down since there is not enough evidence to support it. Old reviews: The paper proposes to use Gumbel-softmax for training sender and receiver agents in a referential game like Lazaridou (2016).
Neural Information Processing Systems
Oct-8-2024, 00:33:36 GMT
- Technology: