Finite Sample Identification of Wide Shallow Neural Networks with Biases
Fornasier, Massimo, Klock, Timo, Mondelli, Marco, Rauchensteiner, Michael
–arXiv.org Artificial Intelligence
Artificial neural networks are functions depending on a finite number of parameters typically encoded as weights and biases. The identification of the parameters of the network from finite samples of input-output pairs is often referred to as the \emph{teacher-student model}, and this model has represented a popular framework for understanding training and generalization. Even if the problem is NP-complete in the worst case, a rapidly growing literature -- after adding suitable distributional assumptions -- has established finite sample identification of two-layer networks with a number of neurons $m=\mathcal O(D)$, $D$ being the input dimension. For the range $D
arXiv.org Artificial Intelligence
Nov-8-2022
- Country:
- North America > United States
- Maryland (0.04)
- Europe
- Austria (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Norway > Eastern Norway
- Oslo (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.65)
- Industry:
- Education (0.48)
- Technology: