Reviews: A Non-convex One-Pass Framework for Generalized Factorization Machine and Rank-One Matrix Sensing

Neural Information Processing Systems 

Major comments -------------- * An obvious major issue with this paper is the lack of experiments. How does the algorithm compare to gradient-based local search algorithms? I would be curious to see if it works better on i) Gaussian distributed data and ii) real data. Even if the results turn out to be similar, perhaps the authors can find some advantages to their algorithm such as, e.g., robustness to initialization. Recent works [1, 2] have called this variant "polynomial network" so the authors might want to use this name instead of "factorization machine".