Transferable Reinforcement Learning via Generalized Occupancy Models