RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning

Aug-15-2025, 11:28:57 GMT–Neural Information Processing Systems

The model is trained to minimise the value function while still accurately predicting the transitions in the dataset, forcing the policy to act conservatively in areas not covered by the dataset. To approximately solve the two-player game, we alternate between optimising the policy and adversarially optimising the model.

dataset, international conference, reinforcement learning, (12 more...)

Neural Information Processing Systems

Aug-15-2025, 11:28:57 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre:
- Research Report
  - New Finding (0.67)
  - Promising Solution (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
6691c5e4a199b72dffd9c90acb63bcd6-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found