Offline Multi-task Transfer RL with Representational Penalization