Efficient Last-Iterate Convergence in Regret Minimization via Adaptive Reward Transformation

Open in new window