Doubly Robust Augmented Transfer for Meta-Reinforcement Learning
–Neural Information Processing Systems
In this paper, we propose a doubly robust augmented transfer (DRaT) approach, aiming at addressing the more general sparse reward meta-RL scenario with both dynamics mismatches and varying reward functions across tasks.
Neural Information Processing Systems
Nov-20-2025, 00:47:40 GMT
- Technology: