Memory Efficient Optimizers with 4-bit States

Open in new window