Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices

Open in new window