Offline Model-based Adaptable Policy Learning Xiong-Hui Chen 1, Y ang Y u

Aug-14-2025, 06:58:14 GMT–Neural Information Processing Systems

In reinforcement learning, a promising direction to avoid online trial-and-error costs is learning from an offline dataset. Current offline reinforcement learning methods commonly learn in the policy space constrained to in-support regions by the offline dataset, in order to ensure the robustness of the outcome policies.

dynamic model, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Aug-14-2025, 06:58:14 GMT

Conferences PDF

Add feedback

Country:
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)
- Asia
  - China > Jiangsu Province
    - Nanjing (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
- Europe
  - Portugal (0.04)
  - Sweden (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
    - Greater London > London (0.04)
- North America
  - Canada > British Columbia
    - Vancouver (0.04)
  - United States
    - California > Los Angeles County
      - Long Beach (0.04)
    - Colorado > Denver County
      - Denver (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
- Oceania > Australia
  - Queensland > Brisbane (0.04)
- South America > Argentina
  - Pampas > Buenos Aires F.D. > Buenos Aires (0.04)

Genre:
- Research Report (0.68)

Industry:
- Information Technology (0.46)
- Marketing (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
470e7a4f017a5476afb7eeb3f8b96f9b-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found