A Policy Gradient Method for Confounded POMDPs