Knowledge Retention for Continual Model-Based Reinforcement Learning
Sun, Yixiang, Fu, Haotian, Littman, Michael, Konidaris, George
–arXiv.org Artificial Intelligence
We propose DRAGO, a novel approach for continual model-based reinforcement learning aimed at improving the incremental development of world models across a sequence of tasks that differ in their reward functions but not the state space or dynamics. DRAGO comprises two key components: Synthetic Experience Rehearsal, which leverages generative models to create synthetic experiences from past tasks, allowing the agent to reinforce previously learned dynamics without storing data, and Regaining Memories Through Exploration, which introduces an intrinsic reward mechanism to guide the agent toward revisiting relevant states from prior tasks. Together, these components enable the agent to maintain a comprehensive and continually developing world model, facilitating more effective learning and adaptation across diverse environments. Empirical evaluations demonstrate that DRAGO is able to preserve knowledge across tasks, achieving superior performance in various continual learning scenarios.
arXiv.org Artificial Intelligence
Mar-6-2025
- Country:
- Asia > China
- Shaanxi Province > Xi'an (0.04)
- Europe
- Austria > Vienna (0.14)
- Netherlands > North Holland
- Amsterdam (0.04)
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.04)
- Quebec > Montreal (0.14)
- Alberta > Census Division No. 15
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- California > Los Angeles County
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia > China
- Genre:
- Overview > Innovation (0.34)
- Research Report > Promising Solution (0.34)
- Workflow (1.00)
- Industry:
- Health & Medicine (0.46)
- Technology: