The Efficiency Spectrum of Large Language Models: An Algorithmic Survey

Open in new window