Reward Function Optimization of a Deep Reinforcement Learning Collision Avoidance System