Reviews: Convergent Block Coordinate Descent for Training Tikhonov Regularized Deep Neural Networks

Oct-9-2024, 03:03:55 GMT–Neural Information Processing Systems

This paper proposes a simple and efficient block coordinate descent (BCD) algorithm with a novel Tikhonov regularization for training both dense and sparse DNNs with ReLU. They show that the proposed BCD algorithm converges globally to a stationary point with an R-linear convergence rate of order one and performs better than all the SGD variants in experiments. However, the motivations of using Tikhonov regularization and block coordinate descent are not very clear. The technical parts are hard to follow due to the absence of many details. The presented results are far from state-of-the-art. In this sense, I am not sure whether the proposed method can be applied to real "DNNs".

convergent block coordinate descent, tikhonov regularization, tikhonov regularized deep neural network, (6 more...)

Neural Information Processing Systems

Oct-9-2024, 03:03:55 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)