90fd4f88f588ae64038134f1eeaa023f-AuthorFeedback.pdf
–Neural Information Processing Systems
Thank you for all the helpful comments. Several related works were raised by the reviewers which we discuss here. We note that the authors have marked their ArXiv submission as containing errors. Each of their inner loops uses SGD to solve the distance-regularized objectives. First, we use the EMA of slow weights to adjust the training parameters during optimization.
Neural Information Processing Systems
Oct-3-2025, 05:38:30 GMT
- Technology: