MetaGrad: Multiple Learning Rates in Online Learning

Feb-14-2020, 14:28:33 GMT–Neural Information Processing Systems

In online convex optimization it is well known that certain subclasses of objective functions are much easier than arbitrary convex functions. We are interested in designing adaptive methods that can automatically get fast rates in as many such subclasses as possible, without any manual tuning. Previous adaptive methods are able to interpolate between strongly convex and general convex functions. We present a new method, MetaGrad, that adapts to a much broader class of functions, including exp-concave and strongly convex functions, but also various types of stochastic and non-stochastic functions without any curvature. For instance, MetaGrad can achieve logarithmic regret on the unregularized hinge loss, even though it has no curvature, if the data come from a favourable probability distribution.

computer based training, educational technology, metagrad, (10 more...)

Neural Information Processing Systems

Feb-14-2020, 14:28:33 GMT

Conferences Web Page

Add feedback

Industry:
- Education > Educational Setting > Online (0.40)

Technology:
- Information Technology
  - Artificial Intelligence > Machine Learning (0.69)
  - Enterprise Applications > Human Resources
    - Learning Management (0.40)