Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs