When Is Partially Observable Reinforcement Learning Not Scary?