Exponential Separations in Symmetric Neural Networks

Jan-19-2025, 00:33:26 GMT–Neural Information Processing Systems

In this work we demonstrate a novel separation between symmetric neural network architectures. Specifically, we consider the Relational Network \parencite{santoro2017simple} architecture as a natural generalization of the DeepSets \parencite{zaheer2017deep} architecture, and study their representational gap. Under the restriction to analytic activation functions, we construct a symmetric function acting on sets of size N with elements in dimension D, which can be efficiently approximated by the former architecture, but provably requires width exponential in N and D for the latter.

architecture, exponential separation, symmetric neural network, (1 more...)

Neural Information Processing Systems

Jan-19-2025, 00:33:26 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)