Sparsity Emerges Naturally in Neural Language Models

Jul-22-2019–arXiv.org Machine Learning

Concerns about interpretability, computational resources, and principled inductive priors have motivated efforts to engineer sparse neural models for NLP tasks. If sparsity is important for NLP, might well-trained neural models naturally become roughly sparse? Using the Taxi-Euclidean norm to measure sparsity, we find that frequent input words are associated with concentrated or sparse activations, while frequent target words are associated with dispersed activations but concentrated gradients. We find that gradients associated with function words are more concentrated than the gradients of content words, even controlling for word frequency.

deep learning, neural network, sparsity, (20 more...)

arXiv.org Machine Learning

Jul-22-2019

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.64)

Industry:
- Transportation > Ground (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.98)
  - Natural Language (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found