Enhancing Memory and Imagination Consistency in Diffusion-based World Models via Linear-Time Sequence Modeling
Lee, Jia-Hua, Lin, Bor-Jiun, Sun, Wei-Fang, Lee, Chun-Yi
–arXiv.org Artificial Intelligence
World models are crucial for enabling agents to simulate and plan within environments, yet existing approaches struggle with long-term dependencies and inconsistent predictions. We introduce EDELINE, a novel framework that integrates diffusion models with linear-time state space modelsto enhance memory retention and temporal consistency. EDELINE employs a recurrent embedding module based on Mamba SSMs for processing unbounded sequences, a unified architecture for joint reward and termination prediction, and dynamic loss harmonization to balance multi-task learning. Our results across multiple benchmarks demonstrate EDELINE's superiority and robustness over prior baselines in long-horizon tasks.
arXiv.org Artificial Intelligence
Feb-1-2025
- Country:
- North America > United States
- California > Santa Clara County > Santa Clara (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.04)
- Middle East > Saudi Arabia
- Northern Borders Province > Arar (0.04)
- Taiwan > Taiwan Province
- North America > United States
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Health & Medicine > Consumer Health (0.71)
- Leisure & Entertainment > Games
- Computer Games (0.68)
- Technology: