A Generative Caching System for Large Language Models