ClusterFusion: Expanding Operator Fusion Scope for LLMInference via Cluster-Level Collective Primitive

Open in new window