Overcoming Overfitting in Reinforcement Learning via Gaussian Process Diffusion Policy