Learning on a Razor's Edge: the Singularity Bias of Polynomial Neural Networks
Shahverdi, Vahid, Marchetti, Giovanni Luca, Kohn, Kathlén
–arXiv.org Artificial Intelligence
In this work, we theoretically analyze sub-networks and their bias through the lens of algebraic geometry. We consider fully-connected networks with polynomial activation functions, and focus on the geometry of the function space they parametrize, often referred to as neuroman-ifold. First, we compute the dimension of the subspace of the neuromanifold parametrized by subnetworks. Second, we show that this subspace is singular. Third, we argue that such singularities often correspond to critical points of the training dynamics. Lastly, we discuss convolutional networks, for which subnet-works and singularities are similarly related, but the bias does not arise.Figure 1: Subnetworks define singular points (orange) of the neuromanifold.
arXiv.org Artificial Intelligence
May-20-2025
- Country:
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Asia
- Middle East > Israel (0.04)
- Japan > Honshū
- Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- Europe
- Genre:
- Research Report (0.64)
- Technology: