How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning