User Tampering in Reinforcement Learning Recommender Systems