On Oracle-Efficient PAC RL with Rich Observations