The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback

Open in new window