RLRF: Competitive Search Agent Design via Reinforcement Learning from Ranker Feedback

Open in new window