Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight