Implicit Bias of Gradient Descent on Linear Convolutional Networks

Gunasekar, Suriya, Lee, Jason, Soudry, Daniel, Srebro, Nathan

Jun-1-2018–arXiv.org Machine Learning

We show that gradient descent on full-width linear convolutional networks of depth $L$ converges to a linear predictor related to the $\ell_{2/L}$ bridge penalty in the frequency domain. This is in contrast to linearly fully connected networks, where gradient descent converges to the hard margin linear support vector machine solution, regardless of depth.

artificial intelligence, convolutional network, machine learning, (16 more...)

arXiv.org Machine Learning

Jun-1-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Research Report (0.64)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning
  - Gradient Descent (0.85)
  - Support Vector Machines (0.55)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found