AdvancingModelPruningviaBi-levelOptimization

Neural Information Processing Systems 

As illustrated by the Lottery TicketHypothesis (LTH), pruning also has the potential of improving their generalization ability. At the core of LTH, iterative magnitude pruning (IMP) isthepredominant pruning method tosuccessfully find'winning tickets'. Yet, the computation cost of IMP grows prohibitively as the targeted pruning ratio increases.