Debiasing Distributed Second Order Optimization with Surrogate Sketchingand Scaled Regularization

Neural Information Processing Systems 

Remark 3Toreachkxt x k2 kx0 x k2 weneedt log ( / )log ( q) iter thattheinputd (see B for adiscussion). AIDE: Fastand Communication Efficient Distributed Optimization.arXivpreprint,