Gradient Estimation with Stochastic Softmax Tricks Max B. Paulus
–Neural Information Processing Systems
The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding.
Neural Information Processing Systems
Oct-2-2025, 17:56:18 GMT