We have read and appreciate all comments, due to page limit we will only address a subset of questions/concerns

Neural Information Processing Systems 

We have read and appreciate all comments, due to page limit we will only address a subset of questions/concerns. We do not expect this will significantly change our performance. R1: Can you evaluate the coarseness of the approximation? By sampling 1000 random singular values with an L2 norm < 50 we get the following results. R4: Approximation = loss not necessarily convex: This is true.