Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control Piotr Miłoś

Open in new window