Equipping Experts/Bandits with Long-term Memory
Kai Zheng, Haipeng Luo, Ilias Diakonikolas, Liwei Wang
–Neural Information Processing Systems
For both problems, the classical performance measure is the learner's (static) regret, defined as the difference between the learner's total loss and the loss of the best fixed action.
Neural Information Processing Systems
Oct-2-2025, 13:27:57 GMT
- Country:
- Asia > China (0.04)
- North America
- Canada (0.04)
- United States
- California > Alameda County
- Berkeley (0.04)
- Wisconsin > Dane County
- Madison (0.04)
- California > Alameda County
- Industry:
- Education (0.68)
- Technology: