Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective