SUS backprop: linear backpropagation algorithm for long inputs in transformers

Open in new window