Optimizing Data Collection in Deep Reinforcement Learning