8abfe8ac9ec214d68541fcb888c0b4c3-Paper.pdf
–Neural Information Processing Systems
More specifically,inour main result (Theorem 3.2) we identify a set of sufficient conditions on the initialization and the network topology under which theglobal convergence ofgradient descent isobtained.
Neural Information Processing Systems
Feb-9-2026, 07:16:10 GMT