Optimistic {\epsilon}-Greedy Exploration for Cooperative Multi-Agent Reinforcement Learning