Controlling Grokking with Nonlinearity and Data Symmetry

Open in new window