A Supplementary Material A.1 Additional Literature Review

Neural Information Processing Systems 

Batch Normalization, Softmax, and Other Layers The current formulation of our improved Lipschitz estimation algorithm is applicable to architectures consisting of convolutional, fully connected, and slope-restricted activation layers.