Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning