cf708fc1decf0337aded484f8f4519ae-Supplemental.pdf

Feb-11-2026, 06:17:17 GMT–Neural Information Processing Systems

We found that training an inverse model is crucial for learning good representations. On the first row,alevel from each environment that one-shot PPGS fails tosolve(thewhitearrowsrepresent thepolicy). Iterative Model Improvement In general settings, collecting training trajectories by sampling actions uniformly atrandom does not grant sufficient coverage ofthe state space. GLAMORGLAMOR [34] learns inverse dynamics to achieve visual goals in Atari games. The only difference withPPGS in terms of settings is that we allowGLAMORto collect data on-policy and for more interactions (2M).

artificial intelligence, trajectory, world model, (15 more...)

Neural Information Processing Systems

Feb-11-2026, 06:17:17 GMT

Conferences PDF

Add feedback

Industry:
- Leisure & Entertainment > Games > Computer Games (0.55)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.51)

Duplicate Docs Excel Report

Title
cf708fc1decf0337aded484f8f4519ae-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found