Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences

Open in new window