SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training