DiscoveringSparsityAllocationforLayer-wise PruningofLargeLanguageModels