Entropy and mutual information in models of deep neural networks

Mar-16-2026, 21:56:06 GMT–Neural Information Processing Systems

We examine a class of stochastic deep learning models with a tractable method to compute information-theoretic quantities. Our contributions are three-fold: (i) We show how entropies and mutual informations can be derived from heuristic statistical physics methods, under the assumption that weight matrices are independent and orthogonally-invariant. (ii) We extend particular cases in which this result is known to be rigorously exact by providing a proof for two-layers networks with Gaussian random weights, using the recently introduced adaptive interpolation method.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Mar-16-2026, 21:56:06 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)