Export Reviews, Discussions, Author Feedback and Meta-Reviews

Feb-7-2025, 17:05:36 GMT–Neural Information Processing Systems

This paper is an essentially theoretical contribution regarding convergence rates for the so-called "Hogwild"-style algorithms for stochastic gradient descent. In these algorithms, the gradient step is produces asynchronously over different chunks of the dataset in parallel, with results updating current weights as they are completed, independent of other parallel updates. Previously, demonstrating theoretical convergence has been difficult and somewhat brittle. They show that one of their proven variants, "Buckwild" provides significant real-world speedups by using lower precision arithmetic to compute the gradient steps. As far as the paper goes, it is generally good. I had little trouble reading and understanding the paper (I think), and they make a point to explain the maths in an intuitive fashion, insofar as it is possible.

algorithm, author feedback and meta-review, export review, (8 more...)

Neural Information Processing Systems

Feb-7-2025, 17:05:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.56)