Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration Lulu Zheng