Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning