An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning

Oct-10-2025, 22:29:30 GMT–Neural Information Processing Systems

In the standard reinforcement learning (RL) setting, the primary goal is to obtain a policy that maximizes a cumulative scalar reward [Sutton and Barto, 2018].

dataset, demonstration, target preference, (15 more...)

Neural Information Processing Systems

Oct-10-2025, 22:29:30 GMT

Conferences PDF

Country:
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- Asia > China
  - Guangdong Province
    - Guangzhou (0.04)
    - Shenzhen (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
fdb11be1acf5e3724737dd585e590146-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found