88c3c482430a62d35e03926a22e4b67e-Supplemental-Conference.pdf

Neural Information Processing Systems 

CoLA and discuss modifications to improve lower precision performance. In Appendix D we expand on the details of the experiments in the main text. We now present the linear algebra identities that we use to exploit structure in CoLA. Finally, for sum we have the Woodbury identity and its variants. Besides the compositional operators, we have some rules for some special operators.