Regret Minimization via Saddle Point Optimization

Neural Information Processing Systems 

A long line of works characterizes the sample complexity of regret minimization in sequential decision-making by min-max programs. In the corresponding saddle-point game, the min-player optimizes the sampling distribution against an adversarial max-player that chooses confusing models leading to large regret.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found