Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation

Open in new window