Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks