On the Optimal Batch Size for Byzantine-Robust Distributed Learning

Open in new window