AITopics | gumbel

The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators areeasy toimplement and lowvariance, butthegoal ofscaling them comprehensively to large combinatorial distributions is still outstanding. Working within the perturbation model framework, we introduce stochastic softmax tricks, which generalizetheGumbel-Softmax tricktocombinatorial spaces.

artificial intelligence, gumbel, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.05)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

3df80af53dce8435cf9ad6c3e7a403fd-Supplemental.pdf

Neural Information Processing SystemsOct-2-2025, 17:56:26 GMT

artificial intelligence, machine learning, relaxation, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Practical Differentially Private Top-k Selection with Pay-what-you-get Composition

David Durfee, Ryan M. Rogers

Neural Information Processing SystemsAug-19-2025, 23:38:45 GMT

However, it is important to consider users' privacy in the dataset, since results from data

algorithm, differential privacy, privacy, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.29)
North America > Canada (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
(2 more...)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (0.69)
Information Technology > Artificial Intelligence > Machine Learning (0.68)
Information Technology > Architecture > Real Time Systems (0.46)

Add feedback

Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces

Neural Information Processing SystemsAug-16-2025, 14:22:19 GMT

Many problems in machine learning reduce to learning a probability distribution (or policy) over sequences of discrete actions so as to maximize a downstream utility function. Examples include generating text sequences to maximize a task-specific metric like BLEU and generating action sequences in reinforcement learning (RL) to maximize expected return.

algorithm, gradient, trajectory, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Maryland (0.04)
North America > Canada (0.04)
Europe > Spain > Canary Islands (0.04)
Asia > Middle East > Israel (0.04)

Genre: Workflow (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.95)

Add feedback

Filters

Collaborating Authors

gumbel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

matricesandaLog Characterization

Practical Differentially Private Top-k Selection with Pay-what-you-get Composition

Learning Generalized Gumbel-max Causal Mechanisms

Direct Policy Gradients: Direct Optimizationof Policiesin Discrete Action Spaces

7a430339c10c642c4b2251756fd1b484-Supplemental.pdf

3df80af53dce8435cf9ad6c3e7a403fd-Supplemental.pdf

3df80af53dce8435cf9ad6c3e7a403fd-Paper.pdf

3df80af53dce8435cf9ad6c3e7a403fd-Supplemental.pdf

Practical Differentially Private Top-k Selection with Pay-what-you-get Composition

Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces