Geometry of Critical Sets and Existence of Saddle Branches for Two-layer Neural Networks
Zhang, Leyang, Zhang, Yaoyu, Luo, Tao
–arXiv.org Artificial Intelligence
This paper presents a comprehensive analysis of critical point sets in two-layer neural networks. To study such complex entities, we introduce the critical embedding operator and critical reduction operator as our tools. Given a critical point, we use these operators to uncover the whole underlying critical set representing the same output function, which exhibits a hierarchical structure. Furthermore, we prove existence of saddle branches for any critical set whose output function can be represented by a narrower network. Our results provide a solid foundation to the further study of optimization and training behavior of neural networks.
arXiv.org Artificial Intelligence
May-25-2024
- Country:
- Asia > China
- Europe
- Switzerland (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Genre:
- Research Report (0.70)
- Technology: