Gradient Estimation with Stochastic Softmax Tricks Max B. Paulus