Memory-Efficient Backpropagation through Large Linear Layers

Open in new window