Learning a subspace of policies for online adaptation in Reinforcement Learning