Exploit Reward Shiftingin Value-Based Deep-RL: Optimistic Curiosity-Based Explorationand Conservative Exploitationvia Linear Reward Shaping