Review for NeurIPS paper: Sample-Efficient Reinforcement Learning of Undercomplete POMDPs