Training the Untrainable: Introducing Inductive Bias via Representational Alignment

Jun-19-2026, 13:20:24 GMT–Neural Information Processing Systems

We demonstrate that architectures which traditionally are considered to be ill-suited for a task can be trained using inductive biases from another architecture. We call a network untrainable when it overfits, underfits, or converges to poor results even when tuning their hyperparameters. For example, fully connected networks overfit on object recognition while deep convolutional networks without residual connections underfit. The traditional answer is to change the architecture to impose some inductive bias, although the nature of that bias is unknown. We introduce guidance, where a guide network steers a target network using a neural distance function.

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Jun-19-2026, 13:20:24 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.28)

Genre:
- Overview (0.92)
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Government (0.67)
- Health & Medicine > Therapeutic Area
  - Neurology (0.67)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Communications > Networks (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Representation & Reasoning (1.00)
    - Cognitive Science (1.00)
    - Natural Language > Large Language Model (0.67)
    - Machine Learning
      - Neural Networks > Deep Learning (1.00)
      - Statistical Learning (0.92)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found