Adaptive World Models: Learning Behaviors by Latent Imagination Under Non-Stationarity
Gospodinov, Emiliyan, Shaj, Vaisakh, Becker, Philipp, Geyer, Stefan, Neumann, Gerhard
–arXiv.org Artificial Intelligence
Developing foundational world models is a key research direction for embodied intelligence, with the ability to adapt to non-stationary environments being a crucial criterion. In this work, we introduce a new formalism, Hidden Parameter-POMDP, designed for control with adaptive world models. We demonstrate that this approach enables learning robust behaviors across a variety of non-stationary RL benchmarks. Additionally, this formalism effectively learns task abstractions in an unsupervised manner, resulting in structured, task-aware latent spaces.
arXiv.org Artificial Intelligence
Nov-2-2024
- Country:
- Europe > Germany > Baden-Württemberg
- Karlsruhe Region > Karlsruhe (0.04)
- Stuttgart Region > Stuttgart (0.04)
- Europe > Germany > Baden-Württemberg
- Genre:
- Research Report (1.00)
- Technology: