AITopics | sparsemax

Converting an n-dimensional vector to a probability distribution over n objects isacommonly used component inmanymachine learning tasks likemulticlass classification,multilabelclassification,attentionmechanismsetc.

artificial intelligence, machine learning, mapping function, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > Massachusetts > Plymouth County > Hanover (0.04)
North America > Canada > Quebec > Montreal (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

f0b76267fbe12b936bd65e203dc675c1-Paper.pdf

Neural Information Processing SystemsFeb-11-2026, 01:32:03 GMT

attention mechanism, exponential family, machine translation, (12 more...)

Neural Information Processing Systems

Country:

Europe > Portugal > Lisbon > Lisbon (0.05)
North America > Canada (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

727a5a5c77be15d053b47b7c391800c2-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-9-2026, 19:00:50 GMT

attention density, exp, exponential family, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.04)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

where,toensurefeasibility,thestepsizeisgivenby γ=min 1, min

Neural Information Processing SystemsFeb-9-2026, 06:43:03 GMT

In this case, points on the boundary of K have one or more zero coordinates. In contrast, softmax(s) exp(s)is always strictly inside the simplex. Alternatively, observe that it is enough to find Z. In this section, we present the active set method [63, Chapters 16.4 & 16.5] as applied to the SparseMAPoptimizationproblem(Eq.4)[13]. Denote the solution of Eq. 13, (extended with zeroes), by ˆξ |Z|.

artificial intelligence, machine learning, toensurefeasibility, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback