Residual Off-Policy RL for Finetuning Behavior Cloning Policies

Open in new window