f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences

Neural Information Processing Systems 

We derive gradients for various f-divergences to optimize this objective.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found