RareGems: FindingLotteryTicketsatInitialization

Neural Information Processing Systems 

A large body of research since the 1980s empirically observed that large neural networks can be compressed orsparsified toasmallfraction oftheiroriginal sizewhilemaintaining theirpredictive accuracy[14-16,20,23,29,45].