LiveMind: Low-latency Large Language Models with Simultaneous Inference

Open in new window