Phase diagram of early training dynamics in deep networks: effect of the learning rate, depth, and width

Neural Information Processing Systems 

Notably, we discover the opening up of a "sharpness reduction" phase,

Similar Docs  Excel Report  more

TitleSimilaritySource
None found