TREND: Tri-teaching for Robust Preference-based Reinforcement Learning with Demonstrations

Open in new window