On Centralized Critics in Multi-Agent Reinforcement Learning