On the geometry of solutions and on the capacity of multi-layer neural networks with ReLU activations