What are small language models and how do they differ from large ones?

Open in new window