Adaptive Policy Transfer in Reinforcement Learning