Injectivity of ReLU networks: perspectives from statistical physics

Maillard, Antoine, Bandeira, Afonso S., Belius, David, Dokmanić, Ivan, Nakajima, Shuta

Feb-27-2023–arXiv.org Artificial Intelligence

When can the input of a ReLU neural network be inferred from its output? In other words, when is the network injective? We consider a single layer, $x \mapsto \mathrm{ReLU}(Wx)$, with a random Gaussian $m \times n$ matrix $W$, in a high-dimensional setting where $n, m \to \infty$. Recent work connects this problem to spherical integral geometry giving rise to a conjectured sharp injectivity threshold for $\alpha = \frac{m}{n}$ by studying the expected Euler characteristic of a certain random set. We adopt a different perspective and show that injectivity is equivalent to a property of the ground state of the spherical perceptron, an important spin glass model in statistical physics. By leveraging the (non-rigorous) replica symmetry-breaking theory, we derive analytical equations for the threshold whose solution is at odds with that from the Euler characteristic. Furthermore, we use Gordon's min--max theorem to prove that a replica-symmetric upper bound refutes the Euler characteristic prediction. Along the way we aim to give a tutorial-style introduction to key ideas from statistical physics in an effort to make the exposition accessible to a broad audience. Our analysis establishes a connection between spin glasses and integral geometry but leaves open the problem of explaining the discrepancies.

artificial intelligence, machine learning, prediction, (15 more...)

arXiv.org Artificial Intelligence

Feb-27-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Switzerland (0.67)
- North America > United States (0.45)

Genre:
- Instructional Material > Course Syllabus & Notes (0.48)
- Research Report (0.64)

Industry:
- Energy > Oil & Gas > Upstream (0.92)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks (1.00)
    - Representation & Reasoning (1.00)
  - Data Science (0.92)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found