Learning a Single Neuron with Bias Using Gradient Descent Gal Vardi

Neural Information Processing Systems 

In this work, we study the common setting of learning a single neuron with respect to the squared loss, using gradient descent.