AdLoCo: adaptive batching significantly improves communications efficiency and convergence for Large Language Models

Open in new window