Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management

Open in new window