Query-OPT: Optimizing Inference of Large Language Models via Multi-Query Instructions in Meeting Summarization

Open in new window