An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
–Neural Information Processing Systems
In the standard reinforcement learning (RL) setting, the primary goal is to obtain a policy that maximizes a cumulative scalar reward [Sutton and Barto, 2018].
Neural Information Processing Systems
Oct-10-2025, 22:29:30 GMT
- Country:
- Asia > China > Guangdong Province (0.28)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology (1.00)
- Technology: