Adaptive Layer Sparsity for Large Language Models via Activation Correlation Assessment

Open in new window