Efficient Preference-Based Reinforcement Learning: Randomized Exploration Meets Experimental Design

Open in new window