Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Oct-10-2024, 14:36:52 GMT–Neural Information Processing Systems

Structured latent variables allow incorporating meaningful prior knowledge into deep learning models. However, learning with such variables remains challenging because of their discrete nature. Nowadays, the standard learning approach is to define a latent variable as a perturbed algorithm output and to use a differentiable surrogate for training. In general, the surrogate puts additional constraints on the model and inevitably leads to biased gradients. To alleviate these shortcomings, we extend the Gumbel-Max trick to define distributions over structured domains.

approximate inference, combinatorial space, leveraging recursive gumbel-max trick, (2 more...)

Neural Information Processing Systems

Oct-10-2024, 14:36:52 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty (0.40)
  - Machine Learning > Neural Networks
    - Deep Learning (0.65)