AdversarialIntrinsicMotivationforReinforcement Learning
–Neural Information Processing Systems
In thispaper,weinvestigatewhether onesuchobjective,theWasserstein-1 distance between a policy's state visitation distribution and a target distribution, can be utilized effectivelyforreinforcement learning (RL)tasks.
Neural Information Processing Systems
Feb-8-2026, 11:56:06 GMT
- Country:
- North America > United States
- California > Alameda County
- Berkeley (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Texas > Travis County
- Austin (0.05)
- California > Alameda County
- North America > United States
- Industry:
- Government (0.47)
- Technology: