Exploiting Multiple Abstractions in Episodic RL via Reward Shaping