Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

Neural Information Processing Systems 

Training a policy in a source domain for deployment in the target domain under a dynamics shift can be challenging, often resulting in performance degradation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found