Frictional Q-Learning