Follow the Perturbed Leader: Optimism and Fast Parallel Algorithms for Smooth Minimax Games

Dec-24-2025, 22:32:02 GMT–Neural Information Processing Systems

We consider the problem of online learning and its application to solving minimax games. For the online learning problem, Follow the Perturbed Leader (FTPL) is a widely studied algorithm which enjoys the optimal $O(T^{1/2})$ \emph{worst case} regret guarantee for both convex and nonconvex losses. In this work, we show that when the sequence of loss functions is \emph{predictable}, a simple modification of FTPL which incorporates optimism can achieve better regret guarantees, while retaining the optimal worst-case regret guarantee for unpredictable sequences. A key challenge in obtaining these tighter regret bounds is the stochasticity and optimism in the algorithm, which requires different analysis techniques than those commonly used in the analysis of FTPL. The key ingredient we utilize in our analysis is the dual view of perturbation as regularization.

optimism and fast parallel algorithm, optimization oracle, perturbed leader, (8 more...)

Neural Information Processing Systems

Dec-24-2025, 22:32:02 GMT

Conferences Web Page

Add feedback

Industry:
- Education (0.82)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.76)