Reviews: Training Deep Networks without Learning Rates Through Coin Betting

Oct-8-2024, 01:43:36 GMT–Neural Information Processing Systems

Summary: This paper is based on the notion (established in existing works) that subgradient descent can be interpreted as betting on a coin. It generalizes existing results to data-dependent bets and, consequently, data-dependent convergence bounds that improve upon the existing ones. The algorithm is then adapted for the training of neural networks and evaluated on this task experimentally. Quality: The derivations are mathematically sound. I verified the proof of Theorem 1. The changes made to adapt COCOB for the training of neural networks (Algo 1 -- Algo 2) are sensible and intuitive.

algorithm, experimental evaluation, training deep network, (12 more...)

Neural Information Processing Systems

Oct-8-2024, 01:43:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)