TheValue-EquivalencePrinciple forModel-Based ReinforcementLearning SupplementaryMaterial