SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

Open in new window