Uncertainty-Based Smooth Policy Regularisation for Reinforcement Learning with Few Demonstrations

Open in new window