When Collaborative Filtering Meets Reinforcement Learning