Adaptive Layer Sparsity for Large Language Models via Activation Correlation Assessment Wei Li

Open in new window