A Smoother Way to Train Structured Prediction Models
Pillutla, Venkata Krishna, Roulet, Vincent, Kakade, Sham M., Harchaoui, Zaid
–Neural Information Processing Systems
We present a framework to train a structured prediction model by performing smoothing on the inference algorithm it builds upon. Smoothing overcomes the non-smoothness inherent to the maximum margin structured prediction objective, and paves the way for the use of fast primal gradient-based optimization algorithms. We illustrate the proposed framework by developing a novel primal incremental optimization algorithm for the structural support vector machine. The proposed algorithm blends an extrapolation scheme for acceleration and an adaptive smoothing scheme and builds upon the stochastic variance-reduced gradient algorithm. We establish its worst-case global complexity bound and study several practical variants.
Neural Information Processing Systems
Feb-14-2020, 15:27:14 GMT
- Technology: