Gradient Estimation with Stochastic Softmax Tricks Max B. Paulus

Neural Information Processing Systems 

The Gumbel-Max trick is the basis of many relaxed gradient estimators. These estimators are easy to implement and low variance, but the goal of scaling them comprehensively to large combinatorial distributions is still outstanding.