Q-Learning with Fine-Grained Gap-Dependent Regret