Smooth Q-learning: Accelerate Convergence of Q-learning Using Similarity