Appendix for Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropagation