OntheConvergenceTheoryofDebiased Model-AgnosticMeta-ReinforcementLearning

Neural Information Processing Systems 

In particular, using stochastic gradients in MAML update steps is crucial for RL problems since computation of exact gradients requires access to a large number of possible trajectories.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found