Voting-Based Multi-Agent Reinforcement Learning