Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration