Training BatchNorm and Only BatchNorm: On the Expressive Power of Random Features in CNNs

Open in new window