Risk-AwareTransferinReinforcementLearning usingSuccessorFeatures SupplementaryMaterial

Neural Information Processing Systems 

Both the discounted and total reward episodic settings are amenable to function approximation.