Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks