Local minima in training of neural networks