Adversarial Intrinsic Motivation for Reinforcement Learning
–Neural Information Processing Systems
It further shows that the policy that minimizes this Wasserstein-1 distance is the policy that reaches the goal in as few steps as possible.
Neural Information Processing Systems
Aug-14-2025, 07:24:17 GMT
- Country:
- North America > United States
- California > Alameda County
- Berkeley (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Texas > Travis County
- Austin (0.15)
- California > Alameda County
- North America > United States
- Industry:
- Government (0.68)
- Technology: