some of them (like what network structure or loss function tend to cause A Vs, and what other new theoretical results
–Neural Information Processing Systems
First of all, we would like to thank all reviewers for the insightful comments and suggestions! Optimization landscape analysis is an important research topic in deep learning. What can be explained by A Vs but not symmetric valleys (SVs). This seemingly contradictory observation can be well explained by A Vs, but not SVs. To be conservative, we used the word "decent probability" in our paper.
Neural Information Processing Systems
Nov-15-2025, 18:48:13 GMT