Design Considerations in Offline Preference-based RL

Open in new window