Why Does Deep Learning Not Have a Local Minimum?

Open in new window