STAR: Efficient Preference-based Reinforcement Learning via Dual Regularization

Open in new window