On the Accuracy of Self-Normalized Log-Linear Models

Andreas, Jacob, Rabinovich, Maxim, Jordan, Michael I., Klein, Dan

Dec-31-2015–Neural Information Processing Systems

Calculation of the log-normalizer is a major computational obstacle in applications of log-linear models with large output spaces. The problem of fast normalizer computation has therefore attracted significant attention in the theoretical and applied machine learning literature. In this paper, we analyze a recently proposed technique known as ``self-normalization'', which introduces a regularization term in training to penalize log normalizers for deviating from zero. This makes it possible to use unnormalized model scores as approximate probabilities. Empirical evidence suggests that self-normalization is extremely effective, but a theoretical understanding of why it should work, and how generally it can be applied, is largely lacking.We prove upper bounds on the loss in accuracy due to self-normalization, describe classes of input distributionsthat self-normalize easily, and construct explicit examples of high-variance input distributions. Our theoretical results make predictions about the difficulty of fitting self-normalized models to several classes of distributions, and we conclude with empirical validation of these predictions on both real and synthetic datasets.

likelihood gap, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Dec-31-2015

Conferences PDF

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (0.68)
  - Natural Language > Machine Translation (0.46)
  - Machine Learning
    - Neural Networks (0.69)
    - Statistical Learning (0.46)

Duplicate Docs Excel Report

Title
On the Accuracy of Self-Normalized Log-Linear Models ∗, Maxim Rabinovich
On the Accuracy of Self-Normalized Log-Linear Models

Similar Docs Excel Report more

Title	Similarity	Source
None found