Policy Improvement for POMDPs Using Normalized Importance Sampling