Reviews: A Smoother Way to Train Structured Prediction Models

Oct-7-2024, 11:13:56 GMT–Neural Information Processing Systems

Overview: This paper proposes an accelerated variance-reduction algorithm for training structured predictors. In this approach the training objective is augmented with a proximal term anchored with a momentum point (eq (3)), the loss is smoothed using Nesterov's smoothing method (adding entropy or L2 to the dual), and a linear-rate solver (SVRG) is applied to the resulting objective in the inner loop. This achieves accelerated convergence rates for training. Comments: * I think that the connection to structured prediction is somewhat weak. In particular, the analysis uses the finite sum and smoothability of the training objective.

objective, oracle, train structured prediction model, (7 more...)

Neural Information Processing Systems

Oct-7-2024, 11:13:56 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Supervised Learning (0.66)
  - Inductive Learning (0.66)