Goto

Collaborating Authors

 Asia


Transformers learn to implement preconditioned gradient descent for in-context learning

Neural Information Processing Systems

Several recent works demonstrate that transformers can implement algorithms like gradient descent. By a careful construction of weights, these works show that multiple layers of transformers are expressive enough to simulate iterations of gradient descent.




Anormativetheoryofsocialconflict

Neural Information Processing Systems

Social conflict is a survival mechanism yielding both normal and pathological behaviors. Tounderstand its underlying principles, we collected behavioral and whole-brain neural data from mice advancing through stages of social conflict.