Reviews: Recurrent World Models Facilitate Policy Evolution

Oct-7-2024, 07:24:23 GMT–Neural Information Processing Systems

Summary: This paper proposes a new way to develop a world model for reinforcement learning. The focus is on the encoding of the visual world, coupled with a world model that learns based on the compressed representation. The world model is a recurrent version of Bishop's (1995, neural networks book, chapter 6) mixture of gaussians network. That network outputs the weights of an MOG (using softmax), the means of the gaussians (linear outputs), and the variance (modeled as e var, so it is a scale parameter). I had not seen a recurrent version of this network before.

recurrent version, world model, world model facilitate policy evolution, (5 more...)

Neural Information Processing Systems

Oct-7-2024, 07:24:23 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Cognitive Science > Problem Solving (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.54)