Sample-Efficient Reinforcement Learning of Undercomplete POMDPs