Quantum Policy Gradient in Reproducing Kernel Hilbert Space