Adaptive Data Exploitation in Deep Reinforcement Learning