Towards strong pruning for lottery tickets with non-zero biases
Fischer, Jonas, Burkholz, Rebekka
–arXiv.org Artificial Intelligence
The strong lottery ticket hypothesis holds the promise that pruning randomly initialized deep neural networks could offer a computationally efficient alternative to deep learning with stochastic gradient descent. Common parameter initialization schemes and existence proofs, however, are focused on networks with zero biases, thus foregoing the potential universal approximation property of pruning. To fill this gap, we extend multiple initialization schemes and existence proofs to non-zero biases, including explicit'looks-linear' approaches for ReLU activation functions. These do not only enable truly orthogonal parameter initialization but also reduce potential pruning errors. In experiments on standard benchmark data sets, we further highlight the practical benefits of non-zero bias initialization schemes, and present theoretically inspired extensions for state-of-the-art strong lottery ticket pruning. Challenging tasks across different domains, from protein structure prediction for drug development to detection in complex scenes for self driving cars, have recently been solved through deep neural networks (NNs).
arXiv.org Artificial Intelligence
Oct-21-2021
- Genre:
- Contests & Prizes (1.00)
- Industry:
- Health & Medicine (0.68)
- Leisure & Entertainment > Gambling (0.96)
- Technology: