Quantity vs. Quality: On Hyperparameter Optimization for Deep Reinforcement Learning