Nuclear Norm Regularization for Deep Learning
–Neural Information Processing Systems
Penalizing the nuclear norm of a function's Jacobian encourages it to locally behave like a low-rank linear map. Such functions vary locally along only a handful of directions, making the Jacobian nuclear norm a natural regularizer for machine learning problems. However, this regularizer is intractable for high-dimensional problems, as it requires computing a large Jacobian matrix and taking its SVD. We show how to efficiently penalize the Jacobian nuclear norm using techniques tailor-made for deep learning. We prove that for functions parametrized as compositions f g \circ h, one may equivalently penalize the average squared Frobenius norm of Jg and Jh .
Neural Information Processing Systems
May-27-2025, 17:50:58 GMT