Proper Value Equivalence

Neural Information Processing Systems 

VE distinguishes models based on a set of policies and a set of functions: a model is said to be VE to the environment if the Bellman operators it induces for the policies yield the correct result when applied to the functions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found