Optimization Dynamics of Equivariant and Augmented Neural Networks

Sep-21-2023–arXiv.org Artificial Intelligence

We investigate the optimization of multilayer perceptrons on symmetric data. We compare the strategy of constraining the architecture to be equivariant to that of using augmentation. We show that, under natural assumptions on the loss and non-linearities, the sets of equivariant stationary points are identical for the two strategies, and that the set of equivariant layers is invariant under the gradient flow for augmented models. Finally, we show that stationary points may be unstable for augmented training although they are stable for the equivariant models.

equivariant, experiment, gradient flow, (17 more...)

arXiv.org Artificial Intelligence

Sep-21-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia (0.04)
- Europe > Sweden
  - Västerbotten County > Umeå (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.54)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found