AITopics | decomposed mutual information optimization

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

Neural Information Processing SystemsDec-25-2025, 00:37:21 GMT

Adapting to the changes in transition dynamics is essential in robotic applications. By learning a conditional policy with a compact context, context-aware meta-reinforcement learning provides a flexible way to adjust behavior according to dynamics changes. However, in real-world applications, the agent may encounter complex dynamics changes. Multiple confounders can influence the transition dynamics, making it challenging to infer accurate context for decision-making. This paper addresses such a challenge by decomposed mutual information optimization (DOMINO) for context learning, which explicitly learns a disentangled context to maximize the mutual information between the context and historical trajectories while minimizing the state transition prediction error. Our theoretical analysis shows that DOMINO can overcome the underestimation of the mutual information caused by multi-confounded challenges via learning disentangled context and reduce the demand for the number of samples collected in various environments. Extensive experiments show that the context learned by DOMINO benefits both model-based and model-free reinforcement learning algorithms for dynamics generalization in terms of sample efficiency and performance in unseen environments.

decomposed mutual information optimization, domino, generalized context, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)

Add feedback

Appendix of Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning Y ao Mu The University of Hong Kong

Neural Information Processing SystemsAug-17-2025, 19:37:58 GMT

Ping Luo is the corresponding author. With Equation 3 and Jensen's inequality applied in Equation 1, we have I (x,y) E Therefore, if the number of confounders increases, then the demand for data will grow exponentially. When data is not rich enough, the nesseray condition may not be satisfied. We provide the pseudo-code of DOMINO combined with model-based methods. Firstly, the past state-action pairs are encoded into the disentangled context vectors by the context encoder. Initialize batch B . for i = 1 to B do sample V Listing 1: PyTorch-style pseudo-code for dynamics change based on Mujoco engine.

artificial intelligence, machine learning, visualization, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.40)
Asia > China > Tianjin Province > Tianjin (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Add feedback

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

Neural Information Processing SystemsJan-18-2025, 13:56:05 GMT

Adapting to the changes in transition dynamics is essential in robotic applications. By learning a conditional policy with a compact context, context-aware meta-reinforcement learning provides a flexible way to adjust behavior according to dynamics changes. However, in real-world applications, the agent may encounter complex dynamics changes. Multiple confounders can influence the transition dynamics, making it challenging to infer accurate context for decision-making. This paper addresses such a challenge by decomposed mutual information optimization (DOMINO) for context learning, which explicitly learns a disentangled context to maximize the mutual information between the context and historical trajectories while minimizing the state transition prediction error. Our theoretical analysis shows that DOMINO can overcome the underestimation of the mutual information caused by multi-confounded challenges via learning disentangled context and reduce the demand for the number of samples collected in various environments.

decomposed mutual information optimization, domino, meta-reinforcement learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.84)

Add feedback

Filters

Collaborating Authors

decomposed mutual information optimization

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning

Appendix of Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning Y ao Mu The University of Hong Kong

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning