Reviews: Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation

Neural Information Processing Systems 

Originality: The proposed approach is a novel combination of well-known techniques such as RL and GAN for recommendation. Related work has been adequately cited. It is clear how the proposed approach differs from the existing literature. Quality: The approach appears to be technically sound. The theoretical analysis and the experiments support the claims.