Achieving Logarithmic Regret in KL-Regularized Zero-Sum Markov Games