Multi-task Offline Reinforcement Learning for Online Advertising in Recommender Systems

Open in new window