Network Degeneracy as an Indicator of Training Performance: Comparing Finite and Infinite Width Angle Predictions

Open in new window