PlanningwithGeneralObjectiveFunctions: GoingBeyondTotalRewards
–Neural Information Processing Systems
Note that inthis simple example, the state transition functionT and the reward functionr stillsatisfy theMarkovproperty.
Neural Information Processing Systems
Feb-9-2026, 16:55:25 GMT
- Country:
- Technology: