Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning