SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training Cong Xie

Open in new window