Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning

Jan-19-2025, 10:26:17 GMT–Neural Information Processing Systems

Unsupervised pre-training methods utilizing large and diverse datasets have achieved tremendous success across a range of domains. Recent work has investigated such unsupervised pre-training methods for model-based reinforcement learning (MBRL) but is limited to domain-specific or simulated data. In this paper, we study the problem of pre-training world models with abundant in-the-wild videos for efficient learning of downstream visual control tasks. However, in-the-wild videos are complicated with various contextual factors, such as intricate backgrounds and textured appearance, which precludes a world model from extracting shared world knowledge to generalize better. To tackle this issue, we introduce Contextualized World Models (ContextWM) that explicitly separate context and dynamics modeling to overcome the complexity and diversity of in-the-wild videos and facilitate knowledge transfer between distinct scenes.

contextualized world model, pre-training contextualized world model, reinforcement learning, (5 more...)

Neural Information Processing Systems

Jan-19-2025, 10:26:17 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Educational Technology > Educational Software > Computer Based Training (0.40)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Cognitive Science > Problem Solving (1.00)