Clarify Technical Contributions (R3 / R4): 2 Gradient Estimation
–Neural Information Processing Systems
We thank all reviewers for their detailed constructive feedback and suggestions. Table B (below) demonstrates this empirically. Gumbel-Softmax has) with significantly less training time and resource consumption. These experiments show that when trained with Gumbel-CRF, the AR decoder outperforms REINFORCE. We will clarify this in the paper.
Neural Information Processing Systems
Aug-17-2025, 02:45:44 GMT
- Technology: