Unsupervised Cross-Domain Transfer in Policy Gradient Reinforcement Learning via Manifold Alignment