Equipping Experts/Bandits with Long-term Memory

Kai Zheng, Haipeng Luo, Ilias Diakonikolas, Liwei Wang

Neural Information Processing Systems 

For both problems, the classical performance measure is the learner's (static) regret, defined as the difference between the learner's total loss and the loss of the best fixed action.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found