Reviews: Deep Structured Prediction with Nonlinear Output Transformations

Oct-7-2024, 22:25:28 GMT–Neural Information Processing Systems

This paper studies the problem of training deep structured models (models where the dependencies between the output variables are explicitly modelled and some components are modelled via neural networks). The key idea of this paper is to give up the standard modelling assumption of structured prediction: the score (or the energy) function is the sum of summands (potentials). Instead of using the sum the paper puts an arbitrary non-linear (a neural network) transformation on top of the potentials. The paper develops an inference (MAP prediction) technique for such models which is based on Lagrangian decomposition (often referred to as dual decomposition, see details below). The training of the model is done by combining this inference technique with the standard Structure SVM (SSVM) objective.

artificial intelligence, baseline, machine learning, (7 more...)

Neural Information Processing Systems

Oct-7-2024, 22:25:28 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.37)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Supervised Learning (0.62)
  - Inductive Learning (0.62)
  - Neural Networks (0.48)