On the Geometry and Optimization of Polynomial Convolutional Networks
Shahverdi, Vahid, Marchetti, Giovanni Luca, Kohn, Kathlén
–arXiv.org Artificial Intelligence
We study convolutional neural networks with monomial activation functions. Specifically, we prove that their parameterization map is regular and is an isomorphism almost everywhere, up to rescaling the filters. By leveraging on tools from algebraic geometry, we explore the geometric properties of the image in function space of this map -- typically referred to as neuromanifold. In particular, we compute the dimension and the degree of the neuromanifold, which measure the expressivity of the model, and describe its singularities. Moreover, for a generic large dataset, we derive an explicit formula that quantifies the number of critical points arising in the optimization of a regression loss.
arXiv.org Artificial Intelligence
Oct-1-2024
- Country:
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom > England
- Asia > Japan
- Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- Europe
- Genre:
- Research Report (0.64)
- Technology: