Training the Untrainable: Introducing Inductive Bias via Representational Alignment

Jun-13-2026, 05:30:17 GMT–Neural Information Processing Systems

We demonstrate that architectures which traditionally are considered to be ill-suited for a task can be trained using inductive biases from another architecture. We call a network untrainable when it overfits, underfits, or converges to poor results even when tuning their hyperparameters. For example, fully connected networks overfit on object recognition while deep convolutional networks without residual connections underfit. The traditional answer is to change the architecture to impose some inductive bias, although the nature of that bias is unknown. We introduce guidance, where a guide network steers a target network using a neural distance function.

architecture, artificial intelligence, proceedings, (5 more...)

Neural Information Processing Systems

Jun-13-2026, 05:30:17 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence (0.59)