Efficient Last-Iterate Convergence in Regret Minimization via Adaptive Reward Transformation