Adaptive Hedge

Mar-15-2024, 02:59:11 GMT–Neural Information Processing Systems

Most methods for decision-theoretic online learning are based on the Hedge algorithm, which takes a parameter called the learning rate. In most previous analyses the learning rate was carefully tuned to obtain optimal worst-case performance, leading to suboptimal performance on easy instances, for example when there exists an action that is significantly better than all others. We propose a new way of setting the learning rate, which adapts to the difficulty of the learning problem: in the worst case our procedure still guarantees optimal performance, but on easy instances it achieves much smaller regret. In particular, our adaptive method achieves constant regret in a probabilistic setting, when there exists an action that on average obtains strictly smaller loss than all other actions. We also provide a simulation study comparing our approach to existing methods.

adahedge, algorithm, hedge, (17 more...)

Neural Information Processing Systems

Mar-15-2024, 02:59:11 GMT

Conferences PDF

Add feedback

Country:
- Europe
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.05)

Industry:
- Education > Educational Setting > Online (0.35)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (1.00)
  - Enterprise Applications > Human Resources
    - Learning Management (0.34)