Regret Minimization via Saddle Point Optimization
–Neural Information Processing Systems
A long line of works characterizes the sample complexity of regret minimization in sequential decision-making by min-max programs. In the corresponding saddle-point game, the min-player optimizes the sampling distribution against an adversarial max-player that chooses confusing models leading to large regret.
Neural Information Processing Systems
Oct-8-2025, 21:12:47 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada > Alberta (0.14)
- United States > California
- San Francisco County > San Francisco (0.14)
- Asia > Middle East
- Technology: