Goto

Collaborating Authors

 Asia


Aligning Gradient and Hessian for Neural Signed Distance Function

Neural Information Processing Systems

Our motivation is grounded in a fundamental observation: aligning the gradient and the Hessian of the SDF provides a more efficient mechanism to govern gradient directions.






Understanding Transformer Predictions Through Memory Efficient Attention Manipulation

Neural Information Processing Systems

Most crucially, they require prohibitively large amounts of additional memory since they rely on backpropagation which allocates almost twice as much GPU memory as the forward pass. This renders it difficult, if not impossible, to use explanations in production.



Implicit variance regularization in non-contrastive SSL

Neural Information Processing Systems

In this work, we provide a comparative analysis of the learning dynamics for the Euclidean and cosine-based asymmetric losses in the eigenspace of the closed-form predictor DirectPred.