We thank our reviewers for their time and valuable comments
–Neural Information Processing Systems
We thank our reviewers for their time and valuable comments. We have observed in the literature and also from personal communication at recent conferences incl. We feel this paper will have a significant impact, by showing that stable training can be obtained with REINFORCE. We agree with your point that we are dismissing non-autoregressive language models. We have addressed these typos, thank you for noting them!
Neural Information Processing Systems
Nov-20-2025, 17:47:54 GMT