Latent World Models For Intrinsically Motivated Exploration
–Neural Information Processing Systems
In this work we consider partially observable environments with sparse rewards. We present a self-supervised representation learning method for image-based observations, which arranges embeddings respecting temporal distance of observations. This representation is empirically robust to stochasticity and suitable for novelty detection from the error of a predictive forward model. We consider episodic and life-long uncertainties to guide the exploration. We propose to estimate the missing information about the environment with the world model, which operates in the learned latent space.
Neural Information Processing Systems
Oct-10-2024, 00:22:37 GMT
- Technology: