Trust Region Policy Optimization of POMDPs