Online ICA: Understanding Global Dynamics of Nonconvex Optimization via Diffusion Processes

Jan-20-2025, 13:05:14 GMT–Neural Information Processing Systems

Solving statistical learning problems often involves nonconvex optimization. Despite the empirical success of nonconvex statistical optimization methods, their global dynamics, especially convergence to the desirable local minima, remain less well understood in theory. In this paper, we propose a new analytic paradigm based on diffusion processes to characterize the global dynamics of nonconvex statistical optimization. As a concrete example, we study stochastic gradient descent (SGD) for the tensor decomposition formulation of independent component analysis. In particular, we cast different phases of SGD into diffusion processes, i.e., solutions to stochastic differential equations.

artificial intelligence, arxiv preprint arxiv, machine learning, (14 more...)

Neural Information Processing Systems

Jan-20-2025, 13:05:14 GMT

Conferences PDF

Add feedback

Country:
- Europe > Spain (0.14)

Industry:
- Education (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning
    - Gradient Descent (0.57)
  - Representation & Reasoning > Optimization (1.00)