PAC Reinforcement Learning with Rich Observations