Variance-Reduced Gradient Estimation via Noise-Reuse in Online Evolution Strategies