Appendix of Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning Y ao Mu The University of Hong Kong

Aug-17-2025, 19:37:58 GMT–Neural Information Processing Systems

Ping Luo is the corresponding author. With Equation 3 and Jensen's inequality applied in Equation 1, we have I (x,y) E Therefore, if the number of confounders increases, then the demand for data will grow exponentially. When data is not rich enough, the nesseray condition may not be satisfied. We provide the pseudo-code of DOMINO combined with model-based methods. Firstly, the past state-action pairs are encoded into the disentangled context vectors by the context encoder. Initialize batch B . for i = 1 to B do sample V Listing 1: PyTorch-style pseudo-code for dynamics change based on Mujoco engine.

artificial intelligence, machine learning, visualization, (15 more...)

Neural Information Processing Systems

Aug-17-2025, 19:37:58 GMT

Conferences PDF

Add feedback

Country:
- Asia > China
  - Hong Kong (0.40)
  - Tianjin Province > Tianjin (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Duplicate Docs Excel Report

Title
b0b1cfc8ede53f452cabf8b9cf4eef76-Supplemental-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found