A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation

Xueying Bai, Jian Guan, Hongning Wang

Neural Information Processing Systems 

Reinforcement learning is well suited for optimizing policies of recommender systems.