Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL

Open in new window