A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs