Merino: Entropy-driven Design for Generative Language Models on IoT Devices

Open in new window