MKOR: Momentum-Enabled Kronecker-Factor-Based Optimizer Using Rank-1 Updates

May-28-2025, 22:22:54 GMT–Neural Information Processing Systems

This work proposes a Momentum-Enabled Kronecker-Factor-Based Optimizer Using Rank-1 Updates, called MKOR, that improves the training time and convergence properties of deep neural networks (DNNs). Second-order techniques, while enjoying higher convergence rates vs first-order counterparts, have cubic complexity with respect to either the model size and/or the training batch size.

artificial intelligence, machine learning, natural language, (20 more...)

Neural Information Processing Systems

May-28-2025, 22:22:54 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > Ontario > Toronto (0.14)

Genre:
- Research Report (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.66)
  - Natural Language (1.00)