Online and Bandit Algorithms for Nonstationary Stochastic Saddle-Point Optimization

Roy, Abhishek, Chen, Yifang, Balasubramanian, Krishnakumar, Mohapatra, Prasant

Dec-3-2019–arXiv.org Machine Learning

Saddle-point optimization problems are an important class of optimization problems with applications to game theory, multi-agent reinforcement learning and machine learning. A majority of the rich literature available for saddle-point optimization has focused on the offline setting. In this paper, we study nonstationary versions of stochastic, smooth, strongly-convex and strongly-concave saddle-point optimization problem, in both online (or first-order) and multi-point bandit (or zeroth-order) settings. We first propose natural notions of regret for such nonstationary saddle-point optimization problems. We then analyze extragradient and Frank-Wolfe algorithms, for the unconstrained and constrained settings respectively, for the above class of nonstationary saddle-point optimization problems. We establish sub-linear regret bounds on the proposed notions of regret in both the online and bandit setting.

algorithm, assumption 1, optimization problem, (12 more...)

arXiv.org Machine Learning

Dec-3-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Massachusetts (0.04)
  - California > Yolo County
    - Davis (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report (0.50)

Industry:
- Leisure & Entertainment (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found