Enhancing Preference-based Linear Bandits via Human Response Time

Open in new window