Review for NeurIPS paper: Optimal Lottery Tickets via Subset Sum: Logarithmic Over-Parameterization is Sufficient

Jan-22-2025, 06:24:42 GMT–Neural Information Processing Systems

Summary and Contributions: This paper considers the strong lottery ticket hypothesis -- a conjecture that a randomly initialized neural network can be pruned (i.e. The authors show that when the target function is a fully-connected neural network, such a pruning will exist with high probability whenever the randomly initialized network has twice as many layers and has a width that is a log(d*l/epsilon) factor larger than the target network. Here, d is the width of the target network, l is its depth, and epsilon is the desired accuracy. This is an improvement over the best/only known result (Malach et al. 2020) on this problem that showed that this can be achieved with a width that is a poly(d, l, 1/epsilon) factor larger than the target. The improvement is achieved by essentially reusing the proof of Malach et al., but fixing a key step where polynomial factors were lost by appealing to known results on random subset sum problems.

logarithmic over-parameterization, neurips paper, optimal lottery ticket, (4 more...)

Neural Information Processing Systems

Jan-22-2025, 06:24:42 GMT

Conferences Web Page

Add feedback

Genre:
- Contests & Prizes (0.65)

Industry:
- Leisure & Entertainment > Gambling (0.65)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.54)