On First-Order Meta-Reinforcement Learning with Moreau Envelopes

Open in new window