BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning

Mar-22-2026, 13:58:38 GMT–Neural Information Processing Systems

Offline model-based reinforcement learning (MBRL) enhances data efficiency by utilizing pre-collected datasets to learn models and policies, especially in scenarios where exploration is costly or infeasible. Nevertheless, its performance often suffers from the objective mismatch between model and policy learning, resulting in inferior performance despite accurate model predictions. This paper first identifies the primary source of this mismatch comes from the underlying confounders present in offline data for MBRL.

artificial intelligence, machine learning, reinforcement learning, (7 more...)

Neural Information Processing Systems

Mar-22-2026, 13:58:38 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.43)