Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL Qin-Wen Luo

Oct-10-2025, 15:54:56 GMT–Neural Information Processing Systems

Offline reinforcement learning (RL) aims to learn a policy from a fixed dataset without additional interactions with the environment.

fine-tuning, offline policy, online fine-tuning, (13 more...)

Neural Information Processing Systems

Oct-10-2025, 15:54:56 GMT

Conferences PDF

Country:
- North America > United States
  - Montana (0.04)
  - Washington > King County
    - Seattle (0.04)
- Asia
  - Japan > Honshū
    - Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
  - China > Jiangsu Province
    - Nanjing (0.04)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Information Technology (0.67)
- Education > Educational Setting
  - Online (0.45)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (0.92)
  - Machine Learning
    - Reinforcement Learning (0.68)
    - Neural Networks (0.67)

Duplicate Docs Excel Report

Title
OptimisticCriticReconstructionandConstrained Fine-TuningforGeneralOffline-to-OnlineRL

Similar Docs Excel Report more

Title	Similarity	Source
None found