Layer as Puzzle Pieces: Compressing Large Language Models through Layer Concatenation

Open in new window