RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning

Dec-24-2025, 09:03:00 GMT–Neural Information Processing Systems

Offline reinforcement learning (RL) aims to find performant policies from logged data without further environment interaction. Model-based algorithms, which learn a model of the environment from the dataset and perform conservative policy optimisation within that model, have emerged as a promising approach to this problem.

adversarial model-based offline reinforcement learning, name change, rambo-rl, (4 more...)

Neural Information Processing Systems

Dec-24-2025, 09:03:00 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > Promising Solution (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)