On Controllable Sparse Alternatives to Softmax

Anirban Laha, Saneem Ahmed Chemmengath, Priyanka Agrawal, Mitesh Khapra, Karthik Sankaranarayanan, Harish G. Ramaswamy

Neural Information Processing Systems 

Converting an n-dimensional vector to a probability distribution over n objects isacommonly used component inmanymachine learning tasks likemulticlass classification,multilabelclassification,attentionmechanismsetc.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found