Constrained GPI for Zero-Shot Transfer in Reinforcement Learning