Taming the Wild: A Unified Analysis of H!-Style Algorithms

Mar-13-2024, 01:45:57 GMT–Neural Information Processing Systems

Stochastic gradient descent (SGD) is a ubiquitous algorithm for a variety of machine learning problems. Researchers and industry have developed several techniques to optimize SGD's runtime performance, including asynchronous execution and reduced precision. Our main result is a martingale-based analysis that enables us to capture the rich noise models that may arise from such techniques.

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Mar-13-2024, 01:45:57 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.28)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.70)