Reviews: Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control

Neural Information Processing Systems 

The paper is well written and easy to read. I very much enjoyed reading the paper. If so, please make it explicit for better clarity. This could also motivate the variance based control loss because when there is not much variance in the message, then that agent do not have any preference over which action to choose and hence its message can be safely ignored. I assume that you are using the same communication protocol even during training.