SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories

Open in new window