SCALM: Towards Semantic Caching for Automated Chat Services with Large Language Models

Open in new window