Learning Multi-Index Models with Hyper-Kernel Ridge Regression
Huang, Shuo, Labarrière, Hippolyte, De Vito, Ernesto, Poggio, Tomaso, Rosasco, Lorenzo
Deep neural networks excel in high-dimensional problems, outperforming models such as kernel methods, which suffer from the curse of dimensionality. However, the theoretical foundations of this success remain poorly understood. We follow the idea that the compositional structure of the learning task is the key factor determining when deep networks outperform other approaches. Taking a step towards formalizing this idea, we consider a simple compositional model, namely the multi-index model (MIM). In this context, we introduce and study hyper-kernel ridge regression (HKRR), an approach blending neural networks and kernel methods. Our main contribution is a sample complexity result demonstrating that HKRR can adaptively learn MIM, overcoming the curse of dimensionality. Further, we exploit the kernel nature of the estimator to develop ad hoc optimization approaches. Indeed, we contrast alternating minimization and alternating gradient methods both theoretically and numerically. These numerical results complement and reinforce our theoretical findings.
Oct-6-2025
- Country:
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.14)
- Europe
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Italy > Liguria
- Genoa (0.04)
- United Kingdom > England
- Asia
- Middle East > Jordan (0.04)
- Japan > Honshū
- Tōhoku > Fukushima Prefecture > Fukushima (0.04)
- Africa > Senegal
- Kolda Region > Kolda (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.87)
- Industry:
- Government > Regional Government (0.46)
- Technology: