Towards Exact Gradient-based Training on Analog In-memory Computing

May-26-2025, 22:43:48 GMT–Neural Information Processing Systems

Given the high economic and environmental costs of using large vision or language models, analog in-memory accelerators present a promising solution for energy-efficient AI. While inference on analog accelerators has been studied recently, the training perspective is underexplored. Recent studies have shown that the "workhorse" of digital AI training - stochastic gradient descent (SGD) algorithm converges inexactly when applied to model training on non-ideal devices. This paper puts forth a theoretical foundation for gradient-based training on analog devices. We begin by characterizing the non-convergent issue of SGD, which is caused by the asymmetric updates on the analog devices.

artificial intelligence, exact gradient-based training, machine learning, (4 more...)

Neural Information Processing Systems

May-26-2025, 22:43:48 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.63)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.63)